FIGURE 1 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTC TGAAGACTAACAT T TTGTGAAG TTGTAAAACAGAAAACC TGT TAGAAATGT GG T GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACATTACT GCAGTAACACT CCACCATATAGAC CCGGC TT TACC T T AT ATCAGTGACAC TGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATT TATGT T CGTTATAAGCAAGT TCAT GC TC TGAGTCC TGAAGAGAACG T TATCATCAAA 
H TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
;J CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 

TATATATG T TTGTTCAGACCATC CT TT CCTACCAAATGCAGCCCAAAATCCATGGCAAACAAG T C 
jUJ TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
fjl AT CAGT T T T GCACAGT GGCAAT T TT GGGAC TGATTTAGAACAGAAACTCCAT TGGAACCCCGAGG 

lft ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 

GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGAT TAACCCT CTAT GACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTT TCCA 
m GAGATAT T TGAT GAAAGGATAAAAT AT TTCTGTAAT GAT TATGATTCTCAGGGAT T GGGGAAAGG 

TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAATGCTGATAAT CAGGAAACATGAAAGAAGCCATTT GATAGAT TAT T CTAAAGGAT AT CAT 
CAAGAAGACTATTAAAAACACCTAT GCCTATACTTTT TTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIAT I YVRYKQVHALS PEENV I IKLNKAGLVLGILSCLGLS I VANFQKT T L FAAHVS GAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRV5ANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



m 
in 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTT T CAT GGTGCCTTT TTACATTGGCTAT TTTATTGT GAGCAATAT CCGACTACT GCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
3 : AAC CATCAGGT TTCT GGGGAATGATAAAAAGTGTTAC CACTTCAGCATCAGGAAGTGAAAATC T TACT 

* CTT ATTCAACAGGAAGTGGATGCT TTGGAAGAATTAAGCAGGCAGCT TT TTCT GGAAACAGCTGATCT 

(3 ATAT GCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATT TTAATTT TCTT GGTT 

10 ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
W GGGAAAACGGATCCTGT CACAAGAGGCATTGAGATCACTGT GAATTATC T GGGAATCCAATTTGATGT 

^ GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

TGCTGAT CACT CTTACCAAGTTCT TTTATGCCATCT CTAGCAGTAAGTC CTCCAATGTCATTGT CCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACT TAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTT CAAAAT TTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATT T TTCACCT TCATAGCATACT CCTTCC CCGT CAGGTGATACTATGAC CATGAGTAGCATCAGCCAG 
AAC ATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CC CGTGTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGA.GACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE a 
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5 1? 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFIWPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFI^FFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMZyyt^RTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKSRIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
T VNYLG IQFDVKFWSQHI S F I LVGI I IVT S IRGLLI TLTKFFYAI S S SKS SNVI VXLLAQIMGMY 
FVSSVLLIRMSMPLEYRT I ITEVLGELQFNFYHRWFDVT FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 



P. 425-444 

5 jjvj; 



N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCT TGTGAGCAAAAAGGCGAACCAGCAGCTGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T TGCAGC CTAT TGT TACAAC TCATC TGATAC TTGGACTAACTCGT GCAT TCCAGAA 
ATT ATCACCACCA7^AGATC CCAT AT TCAACACTCAAACTGCAACACAAACAACAGAAT T TAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 



%J GAAACTAGCACCATGT CTACAGAAACT GAACCAT TTGTTGAAAATAAAGCAGCATTCAAGAATGA 

ffl AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
||| CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
III CAGCAGAAGGAAATGATCGAAACCAAAGTAGTA2VAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
JJ; T GAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGT CCAAGAG TCCAAGCAAAACTACCGT GC 

rfl GAT GCC TGGAAGCTGAAGT T TAGA T GAGACAGAAAT GAGGAGACACACC TGAGGCTGGT T TCT TT 

m CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
ACAGACAGGGTCAAAGTGTT TTCTCTGAACACATTGAGTT GGAATCACT GT T TAGAACACAC ACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC TCT T AT AAAT T T C TAT T T T TAT C T GAG T T AC AGAAAT GAT T AC T AAG G AAG AT T 
ACTCAG TAAT T T GT T T AAAAAGTAATAAAAT TCAACAAACAT T T GCT GAAT AGC TACTAT AT GTC 
AAGTGC TGTGCAAGGTATTACACTCTGTAAT TGAATAT TATTCCTCAAAAAAT TGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T TTTAT TT T TGC TGAGACTAATCT TAT TCAT T T T CT C T AATATGGCAAC CAT TATAACC T TAAT T 
TATTAT T AAC AT AC C T AAGAAGT AC AT TGT TACC TCTATATACCAAAGCACATTTTAAAAGT GCC 
AT TAACAAATGTATCACTAGCCCTCCT TT TTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
TGT GAC AAAAAAT T AAAGCAT T T AGAAAAC T T 
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FIGURE 6 



1^ 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSIAGKDQVETALKASFETCSYGWVGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETS TMS TETE PFVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLGFCYVK 
RWKAFPFTNKNQQKEMIETKWKEEKMDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 



f|| Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGT GAACC TGGCAAACT TACC TC CC AGC TATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCAT CCATGTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
T GGACAAATGGT CT T T TCAGAGACAG T TATCACATC TGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
S3 ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
O GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
If! ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
|s| ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
■§n CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

m 



CCCACAGCCACAGC C T GGTGTAT G T G T GCAAGCC GACCT T CGTGGGGAGCCG TGACCAAGAT GGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
Ifl CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
fj CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
1 1 1 GGAGCAGT TTGCCACCCCTGTCCAGAT TAT TGGCTGCTT T GCC TCTACCAGT TGGCAGACAGCCG 

TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTAC T TAGGTAATT GTAGGGC GAGGAT TATAAATGAAAT T TGC 
AAAATCAC TTAGCAGCAAC TGAAGACAAT TAT CAAC CACGTGGAGAAAATCAAACCGAGCAGGGC 
T GT GT GAAACAT GGT T GTAATAT GC GACT GCGAACAC TGAACT CTAC GCCAC TCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATT GTATAAGCATGC TT TCTT TGAGT TT TAAAT TAT G TATAAACATAAGT T GCATT TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKVGNNTIHVHREIHKITNMQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 



O amino acids 1-19 

m 
u 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 

Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGaCGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT TGAC CACAAGC T CAGAGGAG TGCAGAACCC TGTAGCCCGCT GCATCATGTGCTGT TT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

sj: | 

Zl ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
||1 GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
111 ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
^ GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
Q CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
III CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

If. TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

ft I 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACT GCACTCCAACCTGGGTGACAGACT CTGT C TCCAAAACAAAACAAACAAACAAA 
AAG AT T T TAT TAAAGATAT T TT GT T AAC T C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NOTLM.GQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNA5MLLMRN 
IVRVVVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

It amino acids 57-80 (type II), 110-126, 215-231, 254-274 

m 

|y N-glycosylation sites. 

Wl amino acids 16-20, 27-31, 289-293 



ISBs it 



Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 



Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
L.'^ ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
III CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

w 

*?% GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
|y TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
|f| CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTXGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
P AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
HI GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
Q ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
^ ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSS I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLm/MMTLTNWYKPGETRKM^^ 



Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 




Signal sequence: 



FS 



amino acids 1-20 



ru 
o 
ru 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAG GATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTATTCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACT GCGCCATTGGTGGGCAATAGCGT T GACAACGGCAGTGACCAG TGCCTT T T TAC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C T TGCC TGGAT T GAGACGTGGT T CCTGGAT T TCAAAGT GT TAC CTCAAGAAGCAGAAGAAGAAAA 
CAGACTCC TGATAGT T CAGGATGCT T CAGAGAGGGCAGCACT TATACC T GGTGG TCT T TCTGAT G 
GTCAGT TT TATT CCCCTCC TGAATCCGAAGCAGGATCT GAAGAAGCT GAAGAAAAACAGGACAGT 
G AG AAAC CAC T T T T AGAAC T ATGAGT ACTAC T T T T G T T AAAT G T G AAAAAC C C T CAC AGAAAG T C 



IP 

i: I 

if? 

* ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
? ***' CACTGC TGGCT T TATT GAACAGCTAATAAAGAT T TAT T TAT TG TAAT ACCT CACAAACGT T GTAC 

n fi 

p| CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
Jl 3 TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
p AAAAGACTTAATATATTGAAGTAACACTTTTTTAGTAAGCAAGATACCTT TT TATTTCAAT TCAC 

fU AGAATGGAAT T T T TT TGT TT CAT GT CT CAGAT T TAT TT TGTAT TTCTTTTT TAACACTC T ACAT T 

TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T TGCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAATGTCAC CAGACAT T TGTATTAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
C T AAAT T AAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVL ILAYAVCRLRHWWAIALTT 
AVT SAFLLAKVT LSKL FS QGAFGYVLP I 1 S F ILAW IE TWFLDFKVLPQEAEEENRLL I VQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myri s toyla tion si tes . 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACC CGACACGGT GGGGCT G TACG TCT T TGAGAACGGCATCACCATGC T CGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
N= CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
O GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
jp TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
IX CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
VI TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
W TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
If! TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
m CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
|V CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
J r CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
I'll ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
y'l GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
P'| GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
J! AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
*jf GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
O GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
f|| GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGTTTT TCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGAT AT GAA 
GGCCT T TTGTAAGAAAAAATAAAAGAT GAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSW FGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSIiAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
f% VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
Q TVT QLRPNATY SVCVMPLGPGRVPE GE EAC GEAHT P PAVHSNHAP VT QARE GNL PLL I APALAAV 

W LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGP LE LE GVKVPLE PGPKATEGGGEALP SG SE 

^ CEVPLMG FPG PGLQS PLHAKP Y I 

I? a 

^ Important features: 

s Signal peptide: 

M amino acids 1-23 

In 

III Transmembrane domain: 

ill 

pi amino acids 579-599 

f) EGF-like domain cysteine pattern signature. 

Til amino acids 430-442 

Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-rnyristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATGC G 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGATTCCAAGACTAC T T TGACAT CAGATGAGTCAGT A 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT T AGAATCC T CTAT TCAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGATATCAGCT T T CTAGAGTCT CCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGT A 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
Q T TT CCTAGATAAGGAGTATGATGAATGTACATGAGAT GGGAGGGAAGATGGCAGAC TGT GGT GT G 

fjl C TACAACC TAT GACTACAAAGCAGATGAAAAGTGGGGCT T T TG TGAAAC TGAAGAAGAGGCT GCT 

AAGAGACGGCAGAT GCAGGAAGCAGAAATGATGTAT CAAAC TGGAAT GAAAATCC T TAAT GGAAG 
}5J CAATAAGAAAAGCCAZWWVGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
III CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
| S :- B GCAGCGAGAGAGAT GT TT GAGAAGC TGACTGAGGAAGGCT C TCCCAAGGGACAGAC TGCT C T TGG 

« CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
Q TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 

AAT AAT AT TAAC AT CAGAAGAATT T GT GGT T TATAGCGGCCACAACT TT T T CAGCT TT CAT GAT C 
Wl CAGAT T TGC TTGTAT T AAGACCAAATATTCAGT TGAACT T CCT TCAAAT TCT TGT TAATGGATAT 

Q AACACATGGAAT CT ACATG TAAATGAAAGT TGGTGGAGTCCACAATT T T T CTTTAAAATGAT TAG 

III TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 



GAAT TG TCAGAATC AT TT T T TACATTAGATTAT CAT AAT T T TAAAAAT TT T TCTT TAGT TT T TCA 
AAATTTTGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATTTTGCAACAATGC 
CCTAAGAAT TG T TAAAAT TCATGGAGT TAT T TGTGCAGAATGACTCCAGAGAGC TC TAC T TT CT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC TCATT TT TAATAAAATTATGT C TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCET^^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYTFGALGGNLIAHHVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT TCAGATT TT AAGC CCAT TCT GCAGTGGAAT T TCAT GAAC TAGCAAGAGGACACCAT CT T C T T 
GTAT TATACAAGAAAGGAG T GTACCTATCACACACAGGGGGAAAA ATG CTC T T T TGGGTGC TAGG 
CCTCCTAATCCT CTGT GGTT TTCTGTGGACTCGTAAAGGAAAACTAAAGAT TGAAGACATCACTG 
ATAAGTACAT T T T TAT CAC TGGATGT GACTCGGGCT TT GGAAACT TGGCAGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC T CAGAGAGACT T CGTAC TGTGCTTCTGGATGT GACCGACCCAGAGAATGTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACAC TAGAGGACTACAGAGAACCTATTGAAGT GAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TAT GCAGT GGAAGGT T TCAATGACAGCTTAAGACGGGACAT GAAAGCT T TT GGTGTGCACGT CT C 
ATGCAT TGAACCAGGAT T G T TCAAAACAAACT T GGCAGAT C CAGTAAAGGTAAT T GAAAAAAAAC 
T CGCCATT TGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCC TATGTGAACATGGACCT CTCT CCGGTGGTAGAGT G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCT AATCCCAAGGCAGT GTGACTCAGC TAACCACAAAT GT CT CC TCCAGGC TATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAA?y\AAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSSRLRTVLLDVTDPEOTKRTAQWVKNQVGEKGLWGLimAGVPGVLAPTDWLTLEDY 
RE P I EVNL FGL I SVTLNMLPLVKKAQGRVINVS S VGGRLAI VGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTN1ADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 



m 

a B s 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 



Glycosaminoglycan attachment site. 

Pj amino acids 39-42 

•5* 

ru 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACTGAT TCCCAAATGGATGATG T TGAAGT T G TT TATACAAT TGA 
CAT TCAGAAATATAT T CCATGCTAT CAGCT TTT TAGCT T T TATAAT T CTTCAGGCGAAGT AAAT G 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGT CAT T CAGAT CAGATCATGACGT T T AGAGAGAGGC T GCT TCACAAAAAC T TGCAGGAGCAT T T 
T TCAAACCAAGACCT TGT T T TTCTGCTAT TAACACCAAG TATAAT AACAGAAAGC T GCTC TAC TC 
J= n ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
O GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
CP TTT TAGCC GAGCAGTACAAACACACAGCTC T AAAT TT TT T GAAGAAGAT GGATCC T TAAAGGAGG 

H ; f TACATAAGATAAAT GAAATGTATGCTT CATTACAAGAGGAATTAAAGAGTATATGCAAAAAAGTG 

?J GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 

U1 

m GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
* ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 

0 AT GT C TT TAAAAAATAGACATGT T T CT AAAAGTAGCTGTAACTACAACCACCATCT CGATGTAGT 

Iff 

AGACAATCTGACCTTAATGGTAGAACACACTGACAT TCCT GAAGCTAGT CCAGCTAGTACACCAC 

fl 

SI AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 
p| GAT ACACAAGACAAACGATCTAAAGCAAATACTGG T AGTAG TAACCAAGATAAAGCAT CCAAAAT 

flj GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGTT TTGGTGAATATTCACGGT CTC 

CTACATTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAAC 
AT T TC TAT TGT T T TTAC TATGT T GAGCTAC T TGCAGTAAGT TCAT T T GT TTT TAC TAT G T TCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TC T T TAGAAT TGGAAAAG T GAGAC CAGGCACAGT GGCTCACAC CT GTAATCC CAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T TTCAAAATAT GGAAA 
G AAAT T T A T AT G AAAAT T TAT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T T AG AAG T A 
CAT TAT GGCTAGAGT T GCCAGATAAAATGC TGGATATCAT GCAATAAAT T T GCAAAACAT CATCT 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFS FYNS SGE VNEQALKKI LSNVKKNWGWYKFRRHS DQIMT FRERLLHKNLQEH FSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWMFLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



HI 



V 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGAT GC CCCC CT CCACGAGAT CAAC G GT GAT C ATTT GAAGATCTGT CCC CAGGGTT CTACCTGCT GCT CT 
O CAAGAGAT GGAGGAGAAGT ACAGCCT G CAAAGT AAAGAT GATTT CAAAAGT GT GGTCAGCGAACAGT GCAAT CATTTG 

ffc-J CAAG CTGT CT TT GCTT CACGTTACAAGAAGT TT GAT GAAT T CTT CAAAGAACTACTT GAAAATGCAGAGAAATCCCTG 

AATGATAT GTTT GT GAAGACATATGGCCATT TAT ACATGCAAAATT CT GAGCTATTTAAAGATCT CTT CGT AGAGT TG 
AAAC GTTACTACGT GGTGGGAAAT GT GAACCTGGAAGAAAT GCTAAAT GACTT CT GGGCTC GCCTCCT GGAGCGGATG 
TTCC GCCT GGT GAACT CC CAGTACCACTTTACAGAT GAGTATCT GGAATGT GT GAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTT GCGGGAGATGTCGTGAGCAAGGTCT CCGTGGTAAACCCCACAGCCCAGT GTACCCATGCCCTGTTGAAG 
* AT GAT CTACTGCTC CCACT GCCGGGGTCT CGTGACT GTGAAGC CAT GTTACAACT ACT GCT CAAACAT CAT GAGAGGC 

I J TGTTTGGCC^CCAAGGGGATCTCGATTTTGAATGGAAC^TTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
IF! GAGGGTCC TTT CAACATT GAATCGGT CAT GGAT CCCATCGATGT GAAGATTTCT GAT GCTAT TATGAACAT GCAGGAT 

%nl AATAGTGT T CAAGT GT CT CAGAAGGTT TT CCAGG GATGT GGAC C CCC CAAGCC C CT C C CAGCT GGACGAAT TT CTCGT 

I II T CCAT CTCT GAAAGT GCCT TCAGT GCTC GCT TCAGACCACATCACC CCGAGGAACGC C CAACCACAGCAGCT GGCACT 

1*1 AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
III GTTT GCAACGAT GAGAGGAT GGCTGCAGGAAAC GGCAATGAGGATGACT GT TGGAAT GGGAAAGGCAAAAGCAGGTAC 

CTGTT T GCAGT GACAGGAAATGGATT AGCCAAC CAGGGCAACAACCCAGAGGTC CAGGTTGACACCAGCAAACCAGAC 
ATACT GAT CCTT CGTCAAATCAT GGCTCT T CGAGTGATGACCAGCAAGAT GAAGAAT GCAT ACAAT GGGAACGACGTG 
GACTT CTTT GATATCAGT GATGAAAGTAGTGGAGAAGGAAGTGGAAGT GGCTGT GAGTATCAGCAG TGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
QRGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAG ATAAT TCTCftAACTCTGAG 
AAAAAGTGT TGATCAAA?^AGTTAAAAGGCACCAGTT ATCACTTTTCTAC CATCCTAGT GACTT T GCTT TTTAAAT GAA 
T GGACAACAATGTACAGTT T TTACTATGT GGCCACT GGTTTAAGAAGTGCTGACTTTGTTTTCT CAT T CAGT TTT GGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAAT ATTAAAT AGCT GTACAGAAG CAGGTT TT AT TTATCAT GTTAT CTTATTAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLS AALLAAELKSKS C S EVRRL YVSKGFNKNDAPLHE I NGDHLKI CPQGS T 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLWSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAART FAQGLAVAGDVVSKVS VVNPTAQCTHALLKMI YCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLT VFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



4* 



til 



III 



C TC GCCCTCAAATGGGAAC GC TGGC CT GGGAC TAAAGCATAGACCAC CAGGC T GAGTATCCT GAC 
C TGAGTCATCCCCAGGGATCAGGAGCC TCCAGCAGGGAAC C TTCCAT TATAT TC T TCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT T CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCT TCCAAT TAAACAT TCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTT AACCT TAAATGCAATCAGGAAAGTAGCAAACAGAAGT CAATAAATAT TTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites ♦ 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
jw* GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

m CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

■as.? 

CH T C TT T T TC TGGGAGCATGTGGCAGAAC CATATGGAAGCTGGGC CT TCATGT GGCAGCAAGT T T T C 

GAGCCCACCTGGAAACACAT TGGGGAT GGC TGCTGC C T CAC CAGAGAGACC T GGAAGGAT CT TGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
| w y GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
3 TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
13 GCAGAAT GAGAGAAGACAT T CAT GTACCACCTAC TAG TCCC TC TC TCCCCAACC T CT GCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
Q CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
ffj CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGGT T T TGT TC TCAAATAT T T TT TAATAAATAGACGAA 
ACCACG 



If! 

m 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDWVCTLVLCSVQSPRKVLQETORVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLE QATHQP I YLPLRGT 



Important features: 
PJ Signal peptide: 

t*% amino acids 1-23 



Ul Leucine zipper pattern. 

amino acids 10-32 



N-rnyristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 



in 
o 

If! =i 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAG TGGGTGGGC TAGGGGGGCTGCC T TAT T T AAA 
GTGGT TGTT TAT GAT TCT TATACTAAT T TATACAAAGATAT TAAGGC CCTGT T CAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 



o 
f II 

ni 



MIjLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 



G T T T GAAT T CC T T C AAC TAT ACC CACAG T C CAAAAG CAG AC T CAC T GT G T C CCAGGC T AC CAG T T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATC TGT T T TGATAAATGATG T TGACACCCT CCACC GAAT T C TAAGTGGAATCATGT CGG 
GAAGAGATACAATCCT TGGCC TGTGTATCC TC GCAT TAGCC TT GT CT T T GGCCATGATG T T TACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
C TGCTCGTC TT GAT T T T T GT TCT CAGAAAGAGAATAAAATT GACAGT TGAGC TT T TCCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AAT TGGCC T CAT CT GGAC TAGTGAAT T CATCC T TGC GTGC CAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT T CCGAGAAT CAT T GTCATGT ACAT GCAAAACGCACTGAAAGAACAGCAGCAT GGT GCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAAC TGCTAT TAATGGGACAGATTTC TGTACATCAGCAAAAGATGC 
AT TCAAAAT C TT GT CCAAGAACT CAAGTCACT T TACAT CTAT TAACT GCT T TGGAGAC T TCATAA 

fit TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 

s 

AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGATCGTCAGAAAAGCCC TACT TT AT GGATCAAGAAT T TC TGAG TT T C GTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCATTGTGAG ATAGA TACCCATTTAGGTATCTGTACCTGGAAAACATT 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 



us 

m 



in 
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FIGURE 32 



MSGRDTILGLCILALxALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWL WAVLLS LGTAGAAQVMEGGQVE YKPLS G I RYMWS YHL I GLIWTSEFI LACQQMT I A 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWC FTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 



Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 




amino acids 1-20 



Signal peptide: 



Important features: 



o 



N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 



N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCT CAC TAT GAAGGCAT C TGT TAT TGAAAT GTTC C TTGTT T TGCTGGTGACT GGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT G TCAAAGCC GGAAAGATCAT CGAT CC TGAGT TCAT TGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGT TAT TCCAACGGT GTCCAATCGT TAT CCC TACCACGAT GGAGAGAATCCTT 
TATCGTCTTAGAAAG TAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCAT CAT 
H CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
f»l ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
W CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGATGGAT C TCT GGTCCAC TGCC AC CTACACAAGCAGCCAAAACAGGCCC 
CM AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
III GGATGTCAGCCT GGGACT T GTT CCAAAAGAAGAATT GAGCACACAGTCTT T GGAGCCAGT AT CCC 

|f| TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGTCCAC TGAT GGG T GT T GTCCAGTATGGAGACAACC CT GCTACTCAC T T TAACC TCAAGACAC 
ACACGAAT TCTC GAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACT T TCT AAT 
n GTAGGTCGGGCCAT C TCCT T TGT GACC AAGAACT TCT TTT C CAAAGCCAAT GGAAACAGAAGCGG 

f J GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
|p GACT TGCGAGAGAGT CAGGAATCAACATT T TC TTCATC AC CAT T GAAGGT GCTGC TGAAAAT GAG 

pi AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
Iff GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
fU ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT T TGACAACC T CCAT CAG TATGTCCCCAGGATCATCCAGAACAT T T G T ACAGAGT TCAACT CAC 
AGCCTCGGAAC TGAA T TCAGAGCAGGCAGAGCACCAGCAAG TGC T GC T TTACTAAC TGACGT GTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCT T GT T AT TAT T C T TTGCCAT CATGC T T T T T CATAT TCCAAAACTT GGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGC TACAT CAT G TT GAGGG T GCT GGAGAT T T TACAT 
T T TGACAAT TGT TT TCAAAATAAAT G T TCGGAATACAGTGCAGCCC T TACGACAGGCT TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T G TCAT G AC AAT G TAGGAAT T G C T GAAT TAAAT GTT T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FI VLE S KPKKGVT YP SALT YS S S KS PAAQAGE TTKAYQRPP I PGT TAQPVTLMQLLAVT VAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAI EKI TQRGGLSNVGRAIS FVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARES GINI FFI T I E GAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRIiACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LI T DGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 



App_ID=10063551 



Page 184 



FIGURE 35 



W 

in 



fU 

0 

fii 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGT GACCTCCC ACCCTT C AGATGTGAAAGCC AGGC AGGCCATT AG AGTT ACTT GGGGTGAAAAAAAGTCT 
TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
AT TGTC CTT AGAGGAT GAAC ACCT TCT T T ATGGTGACATAATCCGACAAGAT TT TTTAGACACAT AT AAT A 
ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
AAGACAGACACTGATGTT TTCATCAATACTGGCAAT TTAGTGAAGT ATCT TTT AAACCTAAACCACTCAGA 
GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TT GGTGCC AAGG ATCTAT GAAATGATGGGT C ACGT AAAAC CC ATCAAGT T TGAAGAT GT T TATGTCGG GAT 
CTGT TT GAATTT ATTAAAAGTG AACAT T CATATTCC AGAAGAC ACAAAT CTT TTCT T TCT AT AT AG AAT CC 
ATTT GGATGTCTGTCAACTGAG AC GT GT GATTGCAGCC CATGGCTTT T CT TCCAAGGAGATC ATCACT TTT 
TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 
GAT ACCT TGTGGAAAGTGTTAAATAAAGTAGGTACT GT GGAAAATTCAT GGGGAGGTC AGTGT GCTGGCT T 
ACACTGAACTGAAACT C ATGAAAAACCC AG ACTGGAGACT GGAGGGTT AC ACTTGTGATT TAT T AGTC AGG 
CC CT TC AAAGAT GAT ATGTGGAGGAAT T AAAT AT AAAGGAAT TGGAGGT T TT TGCT AAAGAAATT AAT AGG 
AC CAAAC AATTTGGAC AT GTCATT CT GT AGAC TAGAAT TTCT T AAAAGGGTGTT ACTG AGTT AT AAGCT CA 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATT TATT GAAC AAT GTAGT CACTTGAAGGTTTTGTGTA 
T AT CTT ATGTGGAT T ACCAATT T AAAAAT ATATGTAGTTCT GTGTCAAAAAACTTCT TCACT GAAGTT AT A 
CT GAAC AAAATTTT AC CT GTTT TT GGTC AT T T ATAAAGTAC T TC AAGATGT T GC AGTAT T TC AC AGTT ATT 
ATTATT T AAAATTACTTCAACTTTGTGT TTTT AAAT GTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
T GAATC AT TCTT TACATGCAAACATT TT CC AGTTAC TT AACT GATC AGT T TAT T ATTG AT ACATC ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERWWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFSDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 



p ?5 Important features: 

f*| Type II transmembrane domain: 

Cfl amino acids 20-39 

m 



N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 



p| Ly-6 / u-PAR domain proteins. 



amino acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCARGGATGGAGCTGGGTTGCTGGACGCAG1TGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTC TGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCC C TTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACT CCTGC C TGAT CCAC CCAGGT TG TACCAT C TT TGAAAAC TGC AAGAGC TGCCGAAATGGCTCATGGGGGGGT 
ACCT TGGATGAC TTC T ATGTGAAGGGGTT C TAC TGTGCAGAGTGC CGAGCAGGC TGGTAC GGAGGAGACTGCATGCGAT GT G 
GCC^GGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCC^ 

TAAACCTGGGT T TGTC ATCCAAC TAAGAT TTGTCATGTTGAGT C TGGAGTT TGACTACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGATGGAGAC AACCGCGAT GGCCAGATCAT CAAGCGTGT C TGTGGC AACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCCTGAC TCCACGTCCTC T TCC ACTCCGATGGC TCCAAGAATTTTGACGGTTTCC ATGCCATTTATGAG GAGATCACAGC 
p| ATGCTCC T CATCCCC T TGT T TCCATGACGGCACGTGCGTCCTTGACAAGGC TGGATC TTACAAGTGTGCCT GC TT GGCAGGC 

|*| TATACTGGGCAGCGC TGTGAAAATCTCC T TGAAGAAAGAAACTGC TC AGAC CCTGGGGGCCCAGTCAATGGGTAC CAGAAAA 

||% TAACAGGGGGC C CTGGGCT TATCAACGGACGCCAT GCTAAAAT TGGC AC CGT GGTGTC T TTCT T T TGTAACAACT CCTATGT 

g 9 TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCA 

| sa GAACCAAAGATT TCAGACC T GGT GAGAAGGAGAGTTCTT CC GATGCAGGTT C AGTC AAGGGAGACACCATTACAC CAGC TAT 

Hi ACTCAGCGGCCT TCAGC AAGC AGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCC C TTCC CTTTGGAGAT CT GCC CATGGG 

7 * ATACCAACATC TGCATACCC AGC TCCAGTATGAGTGCATCTCACCCT TC TACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 

r CTGAGGACTGGGAAGTGGAGTGGGCGGGC ACCATC C TGCAT CC C TAT C TGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
* CCCAAGGGT TGCGCTGGCCGTGGC AGGCAGCCATC TACAGGAGGACC AGCGGGGTGC ATGACGGCAGCCTACACAAGGGAGC 

W GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

in 

GTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATGAGAAGACCATCC 
^ AGAGCCTACAGAT T TC T GC TATCATTC TGCATCCCAACTATGACCCC AT CC TGC T TGATGC TGACATCGCCATCC T GAAGCT 

I y CCTAGACAAGGCC CGT ATCAGCAC CCGAGTCCAGC CCATC TGC CT CGC T GCC AGTCGGGATC TCAGCAC TTCCTTC CAGGAG 

1*1 TCCCACATCAC TGTGGC TGGC TGGAATGTCCTGGCAGACG T GAGGAGCC C T GGC T TCAAGAACGACACACTGC GCTC TGGGG 

jf|| TGGTCAGTGTGGTGGACTCGC TGC TGTGT GAGGAGCAGCAT GAGGAC C AT GGC ATC C CAGTGAGTGTCACTGATAACATGT T 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAA ATG&A CCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
CTACTAGGACAGC CAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAGT T TCT TC AAAGAAGACCATATACAAAAC C TCTCCA 
CTCCACT GACC T GGT GGTC T TCCC CAACT TTCAGT TATACGAATGCC ATCAGC TTGACCAGGGAAGATC TGGGC TTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 



m 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
I PCCRNEENECDSCLIHPGCT I FENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQS I GSSLHVLFHSDGSKNFDGFHAI YEE I TACSS SPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRT CQQNGEWS GKQP I C IKACREPKI SDLVRRRVLPMQVQSRE T PLHQLYS AAFSKQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALWERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFICDWI 
ERNMK 

s Important features of the protein: 

Q Signal peptide: 

III 

*m amino acids 1-23 

J|! EGF-like domain cysteine pattern signature. 

amino acids 260-272 
111 N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTC CTACATC CTCT CAT CTGAGAATCAGAGAGCATAAT CTT CT TACGGGCCC GT GATTT ATT AACGT GGCT TAATC 
T GAAGGTT CTCAGTCAAATT CT TTGT GATCTACT GATT GT GGGGGCAT GGCAAGGTTT GCTTAAAGGAGCT T GGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAG AATGA AGGCGCTTCTGTTGC 
T GGTCTTGCCTT GGCT CAGT CCT GCT AACTACAT TGACAAT GT GG GCAACCTGCACTT CCT GTAT T CAGAACT CT GTA 
AAG GT GCCT CC CACTACGGCCT GACCAAAGATAGGAAGAG GCGCT CACAAGAT GGCTGT CCAGAC GGCT GT GCGAGCC 
T CACAGC CACGGCT CCCTCCCCAGAGGTTT CTGCAGCTGC CACCATCT CCTTAAT GACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGC^GAGGACGGGCAGCCAGCAATC^GCCCAGTGGACTCTGGCCGGAGC^CCGAACTA 
GGGCACGGCCCTTT GAGAGATCCACTATTAGAAGCAGATCATT TAAAAAAATAAAT CGAGCTTT GAGT GTT CT TC GAA 

GGACAAAGAGCGGGAGTGCAGTT GCCAAC CATGC CGACCAGGGCAGGGAAAATT CT GAAAACAC CACT GCC CCTGAAG 

PI 

*sl TCTTTCCAAGGTTGTACC^CCTGATTCCAGATGGTGAAATTACCAGC^TC^GATCAATCGAGTAGATCCCAGTGAAA 

W GCCTCT CT AT TAGGCT GGT GGGAGGTAGC GAAACCC CACT GGT CCATATCATTAT CCAACACAT T TAT CGT GATGGGG 

5 T GAT C GCCAGAGACGGCCGGCTACT GCCAGGAGACATCAT T CTAAAGGTCAACGGGAT GGACAT CAGCAAT GTCCCTC 

U* ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

M i GCAGGAACAATGGACAGGCCCCGGAT GC CTACAGACCCC GAGAT GACAGCT TTCAT GT GATT CT CAACAAAAGTAGCC 

t S l CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

s . 

GT GTGGCAT ATCGACATGGTCAGC TT GAGGAGAAT GACC GT GT GTTAGC CATCAATGGACATGAT CTT CGATATG GCA 
s GCCCAGAAAGTGCGGCTCATCTGATT CAGGCCAGTGAAAGACGT GTT CACCTCGT CGTGTCCCGCCAGGTTCGGCAGC 

GGAG CCCT GACAT CTTTCAG GAAGCC GGCT GGAACAGCAAT GGCAGCT G GT CC CCAGGGCCAGGGGAGAGGAGCAACA 

m 

%M CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

n 

GCAT GACC GTC GCAGGGGGAGCAT CACATAGAGAAT GGGATTT GCCT AT CT AT GT CAT CAGTGTTGAGCCC GGAGGAG 

flf 

T CATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTT GTT GAAT GT GGAT GGGGT CGAACT GACAGAGGTCAGCC 
W GGAGT GAGGCAGTGGCATTATT GAAAAGAACAT CAT CCTCGAT AGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 

C CCAGGAAGACT GCAGCAGCCC AGCAGCC CT GGACT CCAAC CACAACAT GGCC CCACCCAGTGACT GGT CCC CAT CCT 
GGG TCATGT GGCTGGAATTACCACGGTGCTT GT ATAACTGT AAAGATATT GTAT T AC GAAGAAACACAGCT GGAAGTC 
T GG GCTTCT GCAT TGT AGGAGGTTAT GAAGAAT ACAAT GGAAACAAAC CT TTTT TCAT CAAAT C CATT GT T GAAGGAA 
CACCAGCAT ACAATGATGGAAGAATT AGAT GTGGTGATAT TCTTCTTGCT GTCAATGGTAGAAGTACAT CAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTT TATAGAAT CAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTAT ATTTATC 
TT GTCAGT T TTTATATTTAAAGAAAGAAT ACATT GTAAAAATGT CAGGAAAAGT ATGATCAT CT AATGAAAGCCAGTT 
ACACCTCAGAAAATAT GAT TCCAAAAAAATTAAAACTACTAGT TT TTTT T CAGT GTGGAGGATT T CTCATT ACT CT AC 
AACATTGT TTAT ATT TTTT CTAT TCAATAAAAAGCC CTAAAAC AACT AAAATGAT TGAT TT GTATACCC CACT GAATT 
CAAGCTGATTTAAAT TTAAAATT TGGTAT AT GCT GAAGTCT GCCAAGGGT ACAT TAT GGCCAT TTT TAATT TACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTT CATCAAACAAGAATAAATATTTTT CAGAAGTTAAA 



lie 
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FIGURE 4Q 



to 

13 



III 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISWPHNYAVRLLRQPCQVLWLTVM 
REQKFRS RNNGQAPDAYRPRDDS FHVI LNKS SPEEQLG IKLVRKVDE PGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLOTTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLWDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 



III N-glycosylation sites. 

Q amino acids 108-112, 157-161, 289-293, 384-388 

III 

o 

3J|5 Tyrosine kinase phosphorylation sites. 

5*1 

amino acids 433-441, 492-500 



N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE ax 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTT GCCT GCAGTGAAGCAGAGAGATAGAT AT TATTCACGTAATAAAAAAC ATG GGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CAC CAGTAACTACTTCGTGGGTGCCATTCAAGAGATT CCTAAAGCAAAGGAGT TCAT GGC TAATTTCC 
ATAAGAC CCTCATTTT GGGGAAGGGAAAAACTCT GAC TAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACT CTTGAATGT GGGC TAT CTAGAAGCCCT CAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
Iff CAT C C CAAGCAT CTGGTGGT T GG CAGGAACAGC ACT G GGTACAG GT T AC GT T ACAGT GGATAT T T TGG 

(M GGG T GT T ACTGCC CT AAGC AGAGAGCAGT T T T T CAAGGT GAAT G GAT TC T C T AACAACTACTGGGGAT 

I s * GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
* . C CT GAAGT GGGT AAAT ATAC AAT G GT C T T CCAC ACT AGAGACAAAGGCAAT GAGGT GAAC GCAGAACG 

jlj GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
f*% TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 



CI 
O 
ill 



TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAG CACACATT AAGAAC CT GT TAC AGCT CATT GTT GAGC T GAATT T T T CCT T T TT GT AT T T TC T 



sy TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTT GTAACAAGACAGCT TTCTTAGTCATT T TGAT 

CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATT TGAGGACTCTGGTTGAAGGAGATT TATTTAAAT T TGAAGTAATATAT TAT GGGAT 
AAAAGGC CACAGGAAATAAGACT GCTGAATGTC TGAGAGAACCAGAGTTGT T CTC GT CCAAGG T AGAA 
AGGTACGAAGATACAATACT GTTATTCAT TTAT CCT GTACAAT CATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTTGGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTT CAGG GGAG GACC T GCC CAGGT ATG CC TT C CAGT GAT G CCCACCAGAGAATACAT TC T CTAT TAG T 
TTTTAAAGAGT TTTTGTAAAATGATTTTGTACAAGTAGGATATGAAT TAGCAGTTTACAAGTTTACAT 
AT T AAC TAATAAT AAAT AT GT CTAT CAAATAC CTC T GT AGT AAAAT GTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYL LEHLHP FLQRQQL DYG I YV I HQAEGKK FNRAKLLNVGYLEALKEENWDCFI FHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites, 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCT CAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCAC TGAGCTCCCAGAT C T GGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAZ^^ 
»i AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



ci 
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FIGURE ±± 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 




N-myristoylation site. 



amino acids 58-59 



amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 




amino acids 1-12 
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FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGG? 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAT^AGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGT GT GACCAATCTGACATGCT GCATGGAACAT GGGGAAGAGGATGT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGATGGGGAGAAAGTGATATGACCT TCAT CTGCGT T GCCAGGAACCC TGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TCTGAAGAGAGAGAGACAAGAAGAG TACAT TGAAGAGAAGAAGAGAGT GGACAT T TGTC GGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAA.TCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATAC GG T TTACT CCAC TGT GGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
E GGT 1 1 VTQNRNRERVDFPDGG YS LKLSKLKKNDS GIYYVGIYS S SLQQPS TQE YVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAMTESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAM)DPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRErPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGG TCT GAT GGCCATTCCAGCAACAACAAT GT CCTTGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T G TGAAT T TT CAT TGAAAAACATCAG T GACATTCAT CCAGAATCCT T CAA 
CTTGCAGTGGT T TT TCAATGACTCT TGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T GTAGTT TAATGGGAATAAAATGTAAGTAT CAGTAGT T TGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTS CNGFS LLVLLLLGWLNAI PLI VSLVEEDQFSQNP ISC FEWWFPG 1 1 GAGLMAI PA 
TTMS LTARKRACCNNRTGMFLS SFFS VI TVIGALYCML I S I QALLKGPLMCNS PSMSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFWPTSNDTMASGWRASSFHFDSEENKHRLIHFSV 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-34, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAG TGAGGAAC CTGCAGAGT CACACAGT TGC TGACCAAT TGAGCTGT GAGCC T GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGC TGAG CACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANOT 
S QKQHS PVPSKAI PLIT PGSAT T C 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



GTGGACTCTGAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGCT GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCC CT TGGCCAAGGGAC CAGAGAAGCAG T TGGCACT GGAGTCAGGCAGGT TCCAGGCT TT GGCGC 
AGC AGAT GCT T TGGGC AACAGGGT C GGGGAAGCAGC CC ATGCT CTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
m GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
||| CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
111 GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGGAATGGA 
PI GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
H ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
HI AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
13 T GGCGGCAGCAGTGGTGGCAGCAGT GGCAACAGTGGT GGCAGCAGAGG TGACAGCGGCAGT GAGT 

*]* CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
C T T C AGAG GACAGG G AG T T T C C AG C AAC AT GAGG G AAAT AAGCAAAG AGG G C AAT CGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATCCAAGCTGGGT TTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCT CT C 
GCATCCCG TGACCT CCAGACAAGGAGC CAC CAGATT GGAT GGGAGCC CCCACACTCCC TCC T TAA 
AACACCACCCTC TCAT CAC TAAT CT CAGCCCT T GCC C T TGAAATAAACCT T AGC TGCC CCACAAA 

AAAAAMAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



n i 



App_ID=10063551 



Page 201 of 3 18 



FIGURE 52 



MKFQGPLACLLIJ^CLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQWGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
S SGS SSGSSSGGS SGGS SGGS SGNS GG SRGDS GSE S SWGS S TGS S SGNHGGS GGGNGEKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTWSETSPGM 

FNFDT FWKNFKS KLGFINWDAINKDQRS SRI P 



Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 2 65-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241 f 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 5 3 
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GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CC T CAGAGGGCAGCAGTCGTC TGGACATGTT TGAGCACAT CAGCC T CAT GACCT TGGACAG T CT A 
CAGAAATGCAT CTT CAGCT TTGACAGCCATTGT CAGGAGAGGC CCAGT GAATATATTGCCACCAT 
CT T GGAGC TCAGTGCCC T TGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGAC TT TC TGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGAC7UI 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CAT TGT CAGAT GAG G AT ATAAGAGC AG AGG C T GACAC CTT CAT GT T T G GAG G CC AT GAC AC C AC G 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAAC TGCATCGGGCAGGCGT TCGCCATGGCGGAGATGAAAGTGG TCCTGGCGT T GAT 
GCTGC TGCACT T CC GGT T CCTGCCAGACCACAC T GAGCCCCGCAGGAAGC T GGAATT GAT CATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPViyYiSPWLLLLLWGSW 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSMIMLDKWQHIASEGS 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

IAFIPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAE 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature . 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGT T T TATGAAGCT T TGGAAGGCACTATGGACAGAAGCTGGTGGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
3 . GTGAT TCGAACAT T TGAGGGTTACT T T TGGAAGCAACAAT ACAT TCT CGAACCTGAATGTCAG TA 

S»sss 

Q GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
CI T CT GGATG T TG T CC CACT GAAT T CC CATGAATACAAACC TATTCAGCAACAGCAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

III 

111 

m 

is. 

n 

S i?3 

5 
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FIGURE 56 

MGPVKQLKRMFEPTRLI AT IMVLLCFALTLCSAFWWHNKGLAL I FCI LQSLALTWYS LSFIPFAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE 57 

C GGCT CGAGCTCGAGC CGAATCGGCT CGAGGGGCAGTGGAGCACCCAGGAGGCC GCCAACATGCT CTGT CT GTGCCTG 
T ACGT GCC GGTCATC GGGGAAGC CCAGAC CGAGTT CCAGT ACTTTGAGTC GAAGGGGCTCCCT GCCGAGCT GAAGT CC 
ATTTT CAAGCTCAGT GTCTT CAT CCCCTCCCAGGAATTCT CCACCTACCGCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GAT GGGCAGCT AGACT TT GAAGAAT TTGT CCAT TAT CT CCAAGATCATGAGAAGAAGCTGAGG 
CT GGT GTT T AAGATT TTG GACAAAAAGAAT GAT GGACG CATTGACGCG CAGGAGAT CAT GCAGT CCCT GCG GGACT TG 
GGAGT CAAGATATCT GAAC AGCAGGCAGAAAAAATT CTCAAGAGCAT GGATAAAAACGGCACGAT GACCAT CGACTGG 
AACGAGTGGAGAGACTACCACCTCCTCC^CCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
AT CTTT GAT GT GGGT GAGAAT CT AACGGT CCCGGAT GAGTT CACAGT GGAGGAGAGGCAGACGGGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCCGCAGCAACAACAT GGGCATCGTTGGTG GCTTCACT CAGATGAT TCGAGAAGGAGGGGCCAGG 
T CACT CTG GCGGGGCAAT GGCAT CAACGT CCTCAAAATT GC CC CCGAATCAGCCATC AAAT T CAT GGC CTAT GAGCAG 
AT CAAGCGC CTTGTT GGTAGTGACCAGGAGACT CT GAGGAT TCACGAGAGGCTT GTGGCAG GGT CCTT GGCAGGGGCC 
?** ATCGCCCAGAGCAGCATCTACCCT^TGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
H AT GCT GGACTGCGCCAGGAGGAT CCT GGCCAGAGAGGG GGT GGCCGCCTT CTACAAAGGCTATGTCCCCAACATGCT G 

111 GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
M AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
ff| CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
T s TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 

G CTGTGAG CATCAGCTACGTGGT CTACGAGAACCTGAAGAT CACCCT GGGCGTGCAGT CGCGG TGACGGGGGGAGGG C 
If! CGCCCGGC^GTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
S«* AAGCTGTCT CGAGCCAAGCTGT GAAAAC CCTAGACGCAC CCGCAGGGAGGGTGGGGAGAGCT GGCAGG C CCAGGGCT T 

^* GT C CT GCT GAC CCCAGCAGACCCTCCTGT TG GTT CCAGC GAAGACCAC AGGCAT T CCT TAGGGT CCAG GGT CAGCAGG 

Ms CT CCGGGCT CACATGT GTAAGGACAGGACAT TT T CT GCAGT GCCTGC CAATAGT GAGCTTGGAGCCTGGAGGCCGGCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
* CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
CI ATAATCCAT GATGAAAGGTGAGGTCACGT GGCCTCCCAGGCCTGACTTCCCAACCTACAGCATT GACGCCAACTTGGC 

»ll TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
W.S CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
CI GGCCTGGAC CCT GTCAGGATGGGC CCCAC CT CAGAACCAAACT CACT GTCC CCACTGTG GC ATGAGGGCAGT GGAGCA 

S\ CCATGTTT GAGGGCGAAGGGCAGAGCGTT TGT GT GT T CTGGGGAGGGAAGGAAAAGGT GTT GGAGGCCT TAAT TAT GG 

^1 ACT GTT GGGAAAAGGGT TTTGT C CAGAAGGACAAGCCGGACAAATGAGCGACTT CTGTGCTT CCAGAGGAAGACGAGG 

Q GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
f 3 \ GGGACCAGCCC CACATT CCACTT GTGTCACT GCTTGGAAC CTATTTAT TTT GTATTT ATTT GAACAGAGTT ATGT C CT 

AACTATTT TTAT AGATTT GTTT AATT AATAGCTT GT CATT TT CAAGT T CATTTT TTATTCATAT TTAT GTT C ATGGT T 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCC^^ 

GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTT GAAGGT GGAAT CCAGT TATT T CCT GCGCTGC GAGGGTTTCT TTATT TCACT CTTT T CT GAAT GTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAATAGGATGCAAAGAT CAATGCAAAAAT TGTTAT ATAT GAACATATAACTGGAGT CGT CAAAAAG 
CAAAT TAAGAAAGAAT TGGACGTTAGAAGTTGT CAT T TAAAGCAGCCTTCTAAT AAAGT TGTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



App_ID=10063551 



Page 207 of 3 18 



FIGURE *8 

MLCLCLYVPVIGSAQTSFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHP VEN I PE 1 1 LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYE Q I KRL VG S DQE T L R I HE RL VAGS LAGAI AQS S I Y PME VL KT RMALRKT GQ Y S GML DCARR 
IIiAREGVAAFYKGYVPNMLGI I PYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKRT LRTEGAFGLYRGLAPNFMKVI PAVS I SYV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGA^GCAGCGGC^GCTCCACTCAGCCAGTACCC^GATACGCTGGGAACCTTCCCCAGCCATOGC 
T TCCCTGGGGCAGATCC TC T TC T GGAGCATAAT TAGCATCATCAT TAT T C T GGC TGGAGCAAT TG 
CACTCAT CATT GGCT T TGG TAT T TCAGGGAGACACT CCATCACAGTCAC TACTG T CGCCTCAGCT 
GGGAACAT TGGGGAGGAT GGAATCCT GAGC TGCACT T TTGAACCTGACAT CAAACT T TCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACATACTCC TG TAT GATTGAAAATGACAT T GCCAAAGCAACAGGGGATAT CAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGT CACCTACAGCT GCTAAAC TCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCT TGGCCACAAAAAAGCAT GCAAAGTCAT TG T TACAACAGGGATC TACAGAACTAT T T CAC 
CACCAGATATGACCTAGTTT TATATT T CTGGGAGGAAATGAATTCATAT CTAGAAGTCTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
C TATC T TCAAAGACAT ATTAGAAG TTGGGAAAATAAT T CAT GTGAACTAGACAAGT GTGT TAAGA 
GTGATAAG TAAAATGCACG T GGAGACAAGTGCATCCCCAGATCT CAGGGACCTCCCCC T GCCT GT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T GTAATGT TGCT CT GAGGAAGCCCCT GGAAAG T C TATC CCAACATAT CCACATCT TATAT TC CAC 
AAAT TAAGCT GT AGTATGTAC CC TAAGACGCTGC TAAT TGAC TGCCAC TTC GCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CT TCC CAAC TGACAAATGCCAAAGT T GAGAAAAAT GATCATAAT T T TAGCATAAACAGAGCAGT 
C GGGGACACCGAT TT TATAAATAAAC T GAGCACC T T CT TT T TAAACAAAAAA7VAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVT 1^ 
CVSS FFAI S WALLPLSPYLMLK 



Important features: 
Signal peptide: 



amino acids 1-26 



III Transmembrane domain: 

11! amino acids 258-281 



1,1 

rii 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



| ; J N-myristoylation sites. 

^ amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



T GACGTCAGAAT CACC ATG GCCAGCTATCC T TACCGGCAGGGC T GCC CAGGAGC TGCAGGACAAG 
CACCAGGAGCCC C TCG GGG TAGCTACTACCC TGGACCCCCC AATAGT GGAGGGCAGTATGG T AGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
T GGAGGGC CCTATGGACACCCCAAT CCTGGGAT GT T CC CCT C TGGAACT CCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T GGTCT T CAT TCAATGATGAGACCT GCC T CATGATGAT AAACATGT T TGACAAGAC CAAGTCA 
P GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
f m \ CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 

m AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

If 1 

Iff GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
H GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 

C CAT GACAGC T T CTCGGATGC TATGACCCAACCATCTGTGGAGAGTGGAGT GCACCAGGGAC CT T 
TCCTGGCTTCT TAGAGTGAGAGAAGTATGT GGACAT CT CT TCTT TTCCT GT CCCTC TAGAAGAAC 



3 



m ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
HI AAT AG T GAGGAC CGG GGC T GAGGCC ACACAGAT AG GG GCC TGAT GGAGGAGAGGATAGAAG T T GA 



ru 



ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATCT G TCTATGGGACCAG T GGC TT GGATTC TGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
T CAGT CTCCAGGAGACAGTGG T CACCT CTCCCTGCCAATAC T T T T T T TAAT T TGCAT TT T T T T TC 
AT T TGGGGCCAAAAGT CCAGTGAAAT T GTAAGC T TCAATAAAAGGAT GAAAC TCT GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLI^INMFDKTKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTSLQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN IRL S FEDFVTMTASRML 

Important features of the protein: 
Signal peptide; 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
jU CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
* GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 



en 

5 J5 R 

111 
111 



III 

o 



Q TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAA?^AAAAAAAAAAAAAAAAAAAA 
fU AAAAAAAAAAAAAAAAAABAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 

DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPW 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 

PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 

WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



1^ Important features of the protein: 



Q Signal peptide: 

amino acids 1-26 

y = 

2 - 3 

f|| Casein kinase II phosphorylation sites. 

|!| amino acids 56-59, 155-158 

!>sss 

N-myristoylation sites. 

?« amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

y a 

259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 



la I 

sirs 



b 

111 



p 

fu 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAG GTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
ffl GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
W AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATT CACTTTTATATAT T TATATAAAAT TAG 

J*,] TAGTGAGATGTAAAAAAAAAAAAAAAAAA 

y 1 



ru 
c? 
ru 
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FIGURE 68 



MANPGLGL LLALGLP FLLARWGRAWGQ I QT T SANENS TVLP S S T S S S S DGNLRPE AI TAI I WFS 
LLAALLLAVGLALLVRKLREKRQTEGT YRP S SEEQFSHAAEARAPQDSKETVQGCLP I 



hi 



m 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 



* cAMP- and cGMP-dependent protein kinase phosphorylation site. 

13 

m 

rii 



amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites, 
amino acids 7-13, 26-32 
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FIGURE 6Q 



GCCAGGAATAACTAGAGAGGAAC AATG GGGT TAT TCAGAGGTTT TGTTTTC CT CTTAGT TCT GT GCCT GCT GCACCAG 
T CAAATACT TCCT TCAT TAAGCT GAATAATAAT GGCTTTGAAGATATT GTCATT GT T ATAGAT C CTAGT GT GCCAGAA 
GATGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGA 
TT TTTTTT CAAAAAT GTAT CTATATT AATT CCT GAGAATT GGAAGGAAAAT CCT CAGT ACAAAAGGC CAAAACAT GAA 
AACCATAAACAT GCTGATGTTAT AGTTGCACCAC CT ACACT C C CAGGT AGAGAT GAACCATACACCAAGCAGTT CACA 
GAATGTGGAGAGAAAGGC GAATACATTCACT T CACC C CTGAC CTTCT ACTT GGAAAAAAACAAAATGAATAT GGAC CA 
C CAGGCAAACTGT TTGT C CAT GAGTGGGCTCAC CTCC GGTGGGGAGT GTTT GAT GAGTACAATGAAGAT CAGCCTTTC 
TACCGTGCTAAGTCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GT CT TAGTAGAGCAT GCAGAATT GATTCTACAACAAAACTGTAT GGAAAAGATT GT CAAT T CT TT 
CCTGATAAAGTACAAACAGAAAAAGCAT CCATAATGTT TAT G CAAAGT ATT GATTCT GT TGTTGAATT TTGTAACGAA 
AAAACCCATAAT CAAGAAGCTCCAAGCCTACAAAACATAAAGT GCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
TCTGAGGAT TTTAAAAACACCAT ACC CAT GGTGACAC CACCT C CTC CAC CT GTCTTCT CATT GC TGAAGAT CAGTCAA 
AGAATTGT GTGCT TAGTT CTT GATAAGT CTGGAAGCAT GGGGGGTAAGGAC CGCCTAAATCGAATGAAT CAAGCAGCA 
1 KS( AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCT GGGT GGGGAT GGTT CACTT T GATAGTACTGCCACTATTGTA 

1*1 AATAAGCTAAT C CAAATAAAAAGCAGTGAT GAAAGAAACACACT CAT GGCAGGATTACCTACATATCCT CT GGGAGGA 

f j ACTTCCAT CTGCT CTGGAATTAAATATGCAT TT CAGGT GATT GGAGAGCT ACATTCCCAACT CGATGGATC C GAAGTA 

#11 CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
Jw J TTTATTGCT TTGGGAAGAGCT GCT GATGAAGCAGTAATAGAGAT GAGCAAGATAACAGGAGGAAGT CATTTTTAT GTT 

||| T CAGATGAAGCT CAGAACAAT GGCCT CATT GAT GCTTT TGGGGCTCT T ACATCAGGAAATACT GATCT CTC CCAGAAG 

g p T CCCTTCAGCTC GAAAGTAAGGGATTAACACTGAATAGTAAT GC CT GGAT GAACGACACTGT CATAAT T GAT AGT ACA 

y_} GTGGGAAAGGACACGTT CTTT CT CAT CACAT GGAACAGTCT GCCTC CCAGTATT TCT CT CT G G GATCCCAGT GGAACA 

ATAATGGAAAATT TCACAGTGGATGCAACTTCCAAAAT GGC CT ATCT CAGTATTCCAGGAACT GCAAAGGT GGGCACT 
T GGG CATACAAT CTTCAAGCCAAAGCGAACCCAGAAACATT AACTAT T AC AGTAACT T CTCGAGCAG CAAATTCTT CT 
GT GCCTCCAATCACAGT GAAT GCTAAAAT GAAT AAGGACGT AAACAGTTT C CCCAGCC CAAT GATTGTT TACGCAGAA 
9 AT TCTACAAGGATAT GTACCT GTTCTTGGAGCCAAT GTGACT GCTTT CATT GAAT CACAGAAT GGACAT ACAGAAGTT 

pi TTGGAACTT TTGGATAATGGT GCAGGCG CT GATT CT TTCAAGAATGAT GGAGTCTACT CCAGGTAT TT T ACAGCATAT 

fjf ACAGAAAAT GGCAGATATAGCTTAAAAGT T CGGGCT CAT GGAGGAGCAAACACT GCCAGGCTAAAATTACGGCCTC CA 

||l CT GAATAGAGCC GCGT ACATACCAGGCT GGGTAGTGAACGG GGAAATT GAAG CAAAC CCGC CAAGAC CT GAAATT GAT 

h\ GAGGATACT CAGACCACCTTGGAGGATTT CAGCC GAACAGCAT C CGGAGGT GCATTT GTGGTAT CACAAGT C CCAAGC 

CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
flj ACATGGACAGCACCAG GAGATAATTTTGAT GTT G GAAAAGTTCAACGT TAT AT CATAAGAAT AAGTGCAAGTATT CT T 

fi% GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCA^ 

AGCTTTGCATTT AAACCAGAAAATAT CTCAGAAGAAAAT GCAACCCAC AT ATTT ATT GCCAT TAAAAGT AT AGAT AAA 
AGCAATTT GACAT CAAAAGTATCCAACAT TGCACAAGTAACTT T GTTTATCCCT CAAGCAAATC CTGAT GACATT GAT 
CCTACACCTACT CCTACTCCTACTCCTACTCCT GATAAAAGTCATAAT T CT GGAGTTAATATTT CTACGCT GGTATTG 
T CTGT GATT GGGTCTGTT GTAATTGT TAACT TTATTTTAAGTAC CACCATT TGA ACCT TAACGAAGAAAAAAATCT TC 
AAGTAGACCTAGAAGAGAGTTTT AAAAAACAAAACAATGTAAGTAAAGGAT ATT T CT GAAT CTT AAAATTCATCCCAT 
GTGTGATCATAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GTAAAAACTGT CAAGATTAAAATTTAATAGTT T CAT TTATTT GTTATTT T ATTT GT AAGAAAT AGTGATGAACAAAG 
ATCCTTTTTC^TACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCM 
GAAATTAAAATCAT CTAT CTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACAT TT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



i n 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKOTSILIPENWKENPQYKRPKHENHPCHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGS CLSRACRI DS TTKL YGKDCQFFPDKVQTEKAS IMFMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMTOFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS SVPP I TVNAKMNKDVNS FPSPMIVYAE I LQGYVPVLGANVTAFI ESQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWVWGE IEANP P 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS I LDLRDS FDDALQVNTTDLSPKEANSKESFAFKPENI SEENATHI FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



C T CCTTAGGTG GAAACCCT GGGAGTAGAGT ACT GAGAGCAAAGACCGGGAAAGACCATACGT CC CCGGGCAGGGGT GA 
CAACAGGT GTCATCTT TTT GAT CTCGTGT GTGGCTGCCTTCCTATTT CAAGGAAAGACGCCAAGGTAATTTTGACC CA 
GAGGAGCAATGATGTAGCCACCT CCTAACCTT CCCTT CTTGAACCCCCAGT TAT GCCAGGAT TTACTAGAGAGTGT CA 
ACTCAACCAGCAAGCGGCT CCT T CGGCTT AACT T GT GGTTGGAGGAGAGAACCT TTGT GGG GCT GCGT T CT CT TAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GTGAT GGGAAT GAAGGTGAAAACTT GGAGATTT CACT TCAGTCATT GCTT CTG CCTGC AAGAT CATC CTTT AAA 
AGTAGAGAAGCT GCT CTGT GTGGTGGTTAACT C CAAGAGGCAGAACT C GTT CTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATQATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTT CAGGAGT GGGAGGAGCAGCAC 
CGCAACTACGT GAGCAGCCTGAAGCG G CAGAT CGCACAGCT CAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCT GCTG GC CTGGGTCT GGACAGGAGCCCC C CAGAGAAAACCCAGGCC GACCT C 
CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
AGGAAGGAC AAGCGGGATGAGTT GGT G GAAG CCATT GAATCAG C CTT GGAGACC CTGAACAAT C CT GCAGAGAACAGC 
CCCAATCACCGT CCTT ACACGGC CTCT GATTT CATAGAAGGGAT CTAC CGAACAGAAAGGGACAAAGGGACATTGTAT 
GAGCT CACCTT CAAAGGGGACCACAAACACGAATTCAAACGGCT CAT CTTATTT CGAC CATT CAGC CC CATCATGAAA 
1 1| GT GAAAAAT GAAAAGCTCAACAT GGCCAACAC GCTTAT CAATGTTAT CGTGCCT CTAGCAAAAAGGGT GGACAAGTTC 

g» CGGCAGTT CAT GCAGAATTTCAGGGAGAT GTGCATT GAGCAGGATGGGAGAGTC CAT CTCACTGTTGT TTACT TT GGG 

■M I AAAGAAGAAATAAATGAAGTCAAAGGAAT ACTT GAAAACACTT C CAAAGCT GCCAACTTCAGGAACTT T AC CT TCAT C 

|Jl CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

T TTTT CTGT GAT GTGGACATCTACTT CACATCT GAAT TCCT CAATAC GTGTAGGCTGAATACACAGC CAGGGAAGAAG 
GT ATTTTAT CCAGTTCTTTTCAGTCAGTACAAT C CT GGCAT AATATACGGC CAC CAT GATGCAGTCC CT CCCT TGGAA 
$ CAGCAGCT GGT CATAAAGAAGGAAACTGGATTT T GGAGAGACTTTGGATTT GGGATGACGT GTCAGTAT CGGTCAGAC 

TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
CACAGCAACCT CATAGT GGTACGGACGCCT GT GCGAGGACT CTT CCACCTCTGGCAT GAGAAGCGCTGCAT GGACGAG 
HI CT GACCCC CGAGCAGT ACAAGAT GTGCAT GCAGT CCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGGGCATGCTG 

p % GT GTT CAGGCACGAGATAGAGGCT CACCT T CGCAAACAGAAACAGAAGACAAGT AGCAAAAAAACATGAACT CCCAGA 

%zi GAAGGATT GTGGGAGACACTTT TT CTTTC CTTT T GCAATTACT GAAAGTGGCTGCAACAGAGAAAAGACTT CCAT AAA 

f| | GGAC GACAAAAGAATT GGACTGATGGGT CAGAGATGAGAAAGCCTCCGATTTCT CTCTGTT GGGCTTTTTACAACAGA 

**? g AATCAAAATCTCCGCTTTGCCTGC^W\AGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 

AGATT ATAAGCCTAAT GGT G TGGAGGTTT T GAT GGT GTTTACAATACACTGAGACCT GTTGTTTTGTGT GCTCATT GA 
[f || AATATTCAT GATTTAAGAGCAGTT TT GTAAAAAATT CATT AGCATGAAAGGCAAGCATATT T CT CCT CATAT GAAT GA 

GCCTAT CAGCAGGGCT CTAGTTT CTAGGAATGCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCTTATTAT GAT ACT 
AGTGAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATATCGT GT CAT ATTT T CCC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
T TTTTTCC CTTGT GAGTTATAGT CTGCTTATTTAAT TACCACTTTGCAAGCCTTACAAGAGAGCACAAGTT GGCCTAC 
ATTTTTATATTTT TTAAGAAGAT ACTTTGAGAT GCATTATGAGAACTTTCAG TT CAAAG CAT CAAATT GAT GCCAT AT 
C CAAGGACATGC CAAAT GCTGATTCT GTCAGGCACT GAAT GTCAGGCATT GAGACATAGGGAAGGAAT GGTTT GT ACT 
AATAC^GACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CTCCTTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGG GTGCT CCT CCTGGCTTCT GGCT TCCATAAGAAGAAATGGAGAAAAAT AT ATAT ATAT ATAT AT ATAT T GT 
GAAAGATCAATCCAT CT GCCAGAATCTAGTGGGATGGAAGTTT TTGCTACATGT TAT CCAC C CCAGG C CAGGTGGAAG 
T AACT GAATTATTTTTTAAATTAAGCAGT TCTACTCAAT CAC CAAGAT GCTTCT GAAAATT G CATTTTATT ACCATTT 
CAAACTAT TTTTTAAAAAT AAATACAGTT AACATAGAGT GGTT T CTT CATT CAT GTGAAAATTATTAGCCAG CACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAGCT GTT GGT GTGTTAAAAAATGCAT TGT AT T GATT TGTACT GGTAGTT T AT GAAAT T TAATTAAAACACAGG 
C CAT GAAT G GAAG GT GGTAT TGC ACAGCT AATAAAAT AT GATT TGT GGAT AT GAA 



13 
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FIGURE 72 



MMMVRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRAN 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVINIAGVKLATEYAAVPFDSFTLQKVYQL^^ 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLIWIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
|n* NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
CI KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
W RKQKQKT S S KKT 

m 
la 

1 1| Important features: 

||| Signal peptide: 

IM amino acids 1-27 

1^ N-glycosylation sites. 

f : | amino acids 315-319, 324-328 

HI 

M N-myristoylation sites. 

amino acids 96-102, 136-142, 212-213, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT TT GTCC TGGGGAT CCA 
GAAACC CATGATACCC TAC T GAACACC GAATCCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCAT CGATATTCAGACAGACAGTGTGACATTT GACCCTGATT TGCC 
j»>% TGCTC TGCAGCCCCAC GGATATGACCAGCC T GGCAC CGAGC CT T TGGACCT GCACAACAAT GGCC 

p ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
CP GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 

T GAAGCCACAT T T GCAGAGC TCCACAT TGTACAT TAT GACTCTGAT TCCTAT GACAGC T T GAG T G 
f«: AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 

y i 

|4 AAT ATAGC T TATGAAC ACAT TCT GAGT CAC T TGCAT GAAGTCAGGCATAAAGAT CAGAAGACCT C 

9 AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
Gl GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
^' AT T T CAAT GGAACAG C T GGAAAAGC T T C AG GG GACAT T G T T CT C C AC AGAAGAG GAGC C C T C T AA 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
p TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
R| TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

C CGAAAGAGTGT GGTC TTCACC TCAGCACAAGCCAC GACT GAGGCATAAAT TCCT TCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCC TGGACATCTCT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 

CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 7 A 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 




amino acids 1-15 



Transmembrane domain: 



amino acids 291-310 



N-glycosylation site. 



m 



amino acids 213-216 



Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 



ill 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
T C T GGTGG T T TGCCTAAACC TGCAAAC ATCACCT TCT TAT CCATCAACATGAAGAATGT CCTACA 
ATGGACTCCACCAGAGGGTCT TCAAGGAGTTAAAGTTACTTACACTGTGCAGTAT TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT TCCTGTTT CCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T CT GT GATGGGCTAT TCCATC TACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATT T 
GAT T T T GAT T TATGGAAATGAAT T TGACAAAAGAT T C T TT GT GCC TGCT GAAAAAATC GTGAT TA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGT GAT GTAT CCAGCCT TAAT GATCCTCAGCC CAGCGGGAACC TGAGGCCCCCTCAGGA 
GGAAGAGGAGGT GAAACATT TAGGGTATGC T TCGCAT T TGAT GGAAAT T TT T TGTGAC TCTGAAG 
(3 AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
11! ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 

O GCTCAGT T TGCAGGAGGAGGTGT CCACACAAGGAACAT TAT TGGAGT CGCAGGCAGCG T TGGCAG 

fi I 

PI TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
III CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACC TATC TCATGCAAT TCAT GGAGGAAT GGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGC AGTCAATAAAT G T T TGCCAGAC TGGGTGCAGAAT TTAT TCAGGT GGG T G T 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTN^ 

DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
I VINF I TLN I SDDSKI SHQDMSLLGKSS DVS SLNDPQPS GNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glyoosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC TGAAGGACCACAACGCCACCAGCATCCT GCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
8 : GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
p| T CGTGGAGTTCCACAT GACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGT GGC 

Q CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
IP TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
jjj CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
S| CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
|7 TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
s AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
13 TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 

in 

;2 ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
Sg T GATCAATGAAAAGGC TGCAGATAAGC TGGGATC TACCCAGATCGTGAAGATCC TAACTCAGGAC 

r ill 

f*j ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
f|| TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 

TTTACACCAAAGGTGACCAACT TATACTCAAC TTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVXGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMT TEAQAT I RMDT SAS G PTRLVLS DCAT S HGS LRI QLL YKL S FLYNALAKQVMNLLVPS LPNL 
VKNQLC PVIEAS FNGMYADLLQLVKVP ISLSI DRLE FDLLYPAI KGDT I QL YLGAKLLDSQGKVT 
KWFNNSAASLTOPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 




Important features of the protein: 
Signal peptide: 



amino acids 1-21 



•Pi; 



N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 




Glycosaminoglycan attachment site. 



amino acids 412-415 



LBP / BPI / CETP family proteins. 



amino acids 407-457 
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FIGURE 7Q 



n 



GAGAGAAG TCAGCCT GGCAGAGAGACT CTGAAATGAGGGAT TAGAGGT GTT CAAGGAGCAAGAGC 
TT CAGCCT GAAGACAAGGGAGCAG TCC CTGAAGACGC TT CTAC TGAGAGGT CTGCCATGGCC TCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
H GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACT GACT TTGGCCATT GGAT TGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGC CT GGGACAT T T AAAAAAATA 



||| 

5 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YSTLLGL PADI QAAQAMMVTS SAI S SLAC 1 1 S WGMRCTVFCQESRAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHGILRDFYSPLVPDSMKFE IGEALYLGI ISSLFSLIAGI ILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 



Important features of* the protein: 
, Signal peptide: 

f»l amino acids 1-24 

Transmembrane domains: 

?H amino acids 82-102, 117-140, 163-182 

in 

I?! 

y. N-glycosylation site. 

$ amino acids 190-193 



PMP-22 / EMP / MP20 family proteins. 



h'| amino acids 4 6-59 



fll 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
C CATGGGAAGAGATAC TCCCCCGGCGAGAGCTGGCACCCCTACT TGGAGCCACAAGGCC T GAT GT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAA.TCGGATGAAGAGGACA 
GTGTGCAGTCGCT CCATGGGGT GAGACAT CCTCAGGAT CCATGT T CCAGT GAT GCTGGGAGAAA.G 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
C AGACCCAAGGGAGCAGGCAGCACAAC TGTCAAGAT CGTC C TGAAGGAGAAACATAAGAAAGCC T 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGAC T TCCAGAAAGAGGCACAGCAC T T CCGACT GC TCGC TGGC CCCCACGAAGGT C ACT 
GGAACGTCT TCCTAGCCCAGACCCT GGAGCTGAAGGTCACGGCCAGT CCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAA CAGTTGCAGATATGAGCTGTATAATTGT TGT TAT TATATAT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAA^AAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSS 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
IS STRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 



|p amino acids 1-25 
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FIGURE 83 



si 



5fS 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTT T C CATGCCGT GAGGTCCATTCACAGAAC AC AT CC ATGGCTCT CATGCTCAGTTT GGT T CTGAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGG ACAAAACT GGTGAAGGATTCTATTGCGGAGGGGCGCATCT CTCTGAGGCTGGAAAACATT ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
f*l AC AGGT GT C AGC ACTGGGCT CAGT TCCT CTCATTTCCATCACGGGAT ATGTT GATAGAGACAT C C AGCT AC 

Q TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
rr T T TT TT GGC AT T GT T GGACT GAAGATT T TCTTCTCC AAAT TCCAGT GGAAAATCCAGGCGGAAC TGGACT G 

GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 

X 

f*% AGAC GGCTCACC CGAAGC T CTGCGTTT C TG ATCTGAAAACTGTAACCC AT AGAAAAGCTC CC C AGG AGGT G 

10 CCTC ACTCTG AG AAGAGAT T T ACAAGGAAG AGTGTGGTGG CTTCTCAGAGTT TC CAAGCAGGGAAACAT T A 

Q CT GGGAGGTGGAC GGAGGAC ACAAT AAAAGGTGGCGCGTG GGAGTGTGCCGGGATG AT GT GGAC AG GAGGA 

ftj AGGAGT AC GT G ACT TT GT CTCCCGATCATGGGTACT GGGT CC TCAG AC TGAATGGAGAAC AT TTGT ATTTC 

O ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
t^l TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 

AAGGCT T ATTG AG G CC CT AC ATTGAGTATCCGTCCT ATAATGAGC AAAAT GGAACT CCCAT AGTC ATCT GC 
CC AGTC AC CC AGGAAT CAGAGAAAGAGGCCTCTTGGCAAAGGGCCT CT GC AATCCC AG AGAC AAGC AACAG 
TGAGTC CT CCT C AC AGGC AACCACGCCC TTCCTCCC CAGGGGTG AAAT GTAGG ATGAAT C AC ATC CCACAT 
TC T T CT T T AGGG AT AT T AAGGTCT CTCT CCCAGATCCAAAGTCCCGCAGCAGCCGGCC AAGGTGGCTT CCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATT ACATTTAGTTTGCTCT CACT CCATCTGGCTAAGTGATCTT GAAATACCACCTCT CAGGTGAAG 
AACC GT C AGG AATT CCCAT C TCAC AGGCTGTGGTGT AGAT T AAGT AGACAAGGAATGT GAAT AAT GCT TAG 
AT CT T AT TGAT G ACAGAGTGTATC CTAATGGTTT GT TCAT T ATATT ACACT T TCAGT AAAAAAA 
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FIGURE 8a 



w 

5 amino acids 239-255 

jKRS. 



MALMLS LVLS LLKLGS GQWQVFGPDKPVQALVGEDAAFSC FLS PKTNAE AMEVRF FRGQ FS SWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRIS SQS YYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTLSPDHGYWVLRLNGEHLY FT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PI VI CPVTQESEKEAS WQRASAI PE TSNSES S SQATT P FLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 
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FIGURE 8 s 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGA CATGC TGCTGCTGCTGCTGCCCCT 
GC T C TGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC TGC TGACGAT GCAGAGT TC CGT GA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCC TGAGCAT CAGAGATGCCAGAAGAAG TGATGCGGGGAGA 
j m , TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
h% TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
fl AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

CP GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

M 

fZ GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

II! 

|p ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
1^ GACGGCACAGTAT CCACAGTCT TGGGAAATGGCTCATC TC TGTCAC T CC CAGAGGGCCAGTCT CT 

* GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
O GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
jlj CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
Q CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
jfU TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AG CC T C T C AGG GG C CC C T GAC T GAACC T T G GG CAGAAG AC AG T CC CC CAGACCAGC C T CC CC CAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGC TGAT TC TTGTAGAAT TAACAGC CC TCAACG T GATGAGC TATGATAACAC TATGAAT T ATG 
T GCAGAGTGAAAAGCACACAGGCTT TAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
T CCCT TT TATT TT TT TAACTAAAAGACAGACAAATT CCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFKMEKGSIPCWOT 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVS TVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 



^ amino acids 1-15 



Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
C CAAGGAAAGTGCAGC TGAGACT CAGACAAGAT TACAATGAACCAACT CAGCT TCC TGCT GT T TC 
T CATAGCGAC CACCAGAGGATGGAGTACAGATGAGGC T AATACT TACT T CAAGGAATGGACC T GT 
T CT TCG TC T CCATCTCTGCCCAGAAGCT GCAAGGAAAT CAAAGACGAAT GT CC TAGTGCATT TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
^ CAAC TACAACAC C T T T GGAT C T G CAGAG GC GGCCAC GAGCGAT GAC T ACAAGAACC C TG G C T AC T 

pi ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
111 AGAAACAGCTCC C T GC TGAGG TACCGCACGGACAC TGGCTT CCT CCAGACAC T GGGACATAAT C T 

W GT TTGGCATC TACCAGAAATAT CCAGT GAAATATGGAGAAGGAAAGTGT TGGAC TGACAACGGCC 

Jf! CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
|7 GGC CAGCGGGAAT T CAC TGCGGGATT TGTTCAGTTCAGGGTAT T TAATAAC GAGAGAGCAGCCAA 

I CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

GAT AC T T TCCAGAGGCCAGTCCC CAGCAGTGTGGAGAT TT T TCTGGT T T TGAT TGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAACT TACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATAT T GAC T CAAGA 
AAAAAA 



13 

m 



'til 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTSNGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
E GKCWTDNGPVI PWYD FGDAQKTAS YYS P YGQRE FT AG FVQ FRVFNNERAANALCAGMRVT GCN 
TEHHC I GGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSS SRE I TEAAVLLFYR 



|^ Important features: 

Signal peptide: 

n 

III amino acids 1-16 

HI N-glycosylation site. 

Iff amino acids 163-167 

N* 

Glycosaminoglycan attachment sites. 

Q 

HI amino acids 74-78, 289-293 

ffj N-myristoylation sites. 

*** amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
P TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTCATTACCTTCA 
O TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 

S! GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 

IP 

hi TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 

W 

HI T TAAGACT TATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 

in 

at 

MS 

0 

m 

S 

■; s ; 
- IT 
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FIGURE QO 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



111 

!?;;§ 



m 



Q 



ru 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 



App_ID=10063551 



Page 240 of 3 18 



FIGURE 01 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AG T TCCACCAGGAACAGCAGATC T TCATCGAAGCCAAGGCC T CCAAAAT CT TCAAC TGCC GGAT G 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCAGGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGG TGAC CCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAG TGGGCTCT C TGTGCAGCC T CACAGGGCTT T GCCACGGAGCCACAGAGAGATGCT GGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE 02 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGH IS PKSRPMANS TL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNIS I SLVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRT SLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2 -oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
y TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
111 AAGCCTGAAA.CTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
W AACCTCAGGGAACCAGCACT TCCCAAACCGCAGACTACATCT TTAGAGGAAGCACAACTGTGCCT 



IP 



TT TTC T GAAAAT CCCT T T TTC TGGT GGAAT TGAGAAAGAAATAAAAC TATGCAGATA 
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FIGURE QA 

MTAAVFFGCAFI AFGPALAL YVFT IAIEPLRII FL I AGAFFWL VS LL I S SLW FMARVI I DNKDG 
PTQKYLLI FGAFVSVYI QEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 



If! 



m 

iri: L 

n| 



Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 
Glycosaminoglycan attachment site. 



* amino acids 120-123 



Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 



AATT T T TCACCAGAGTAAAC TT GAGAAACCAAC TGGACCTT GAGTAT T GTACAT T T TGCCT CG TG 
GACCCAAAGGTAGCAATC TGAAAC ATGA GGAGTACGATTCT ACT GTT TTGTC TT CTAGGAT CAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCT TCTT TAAGTCTGATACCATTAACA 
CAGAT GCTCACAC TGGGGCCAGAT CT GCAT CTGT TAAAT CCTGC T GCAGGAATGACAC CTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGT T GAAT GTACAACAGCAAC TGCACCCACAT GTGT TAC 
CAAT T TTTGTCACACAAC T T GGAGCC CAGGGCACTAT CC TAAGC TCAGAGGAAT T GCC ACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCT CCCAACT CCCAGTGGCACAGAT GACGACT TTGCAGTGACCACCCCT 
Jp GCAGGCATCCAAAGGAGCACACAT GCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAAT TCA 

iri G TAAG CTG TT T CAAAT T T T T TCAACTAAGCTGCCT CGAAT T TGG T GATACATG T GAAT CT T TATC 

|r,is AT TGAT TATATTATGGAATAGAT T GAGACACAT T GGATAG TC TTAGAAGAAAT T AAT T CT TAAT T 

$ TACCT GAAAATAT TC T TGAAAT T T CAGAAAATATG TT CTATG TAGAGAATCCCAACTT T TAAAAA 

CAATAATTCAATGGATAAAT CT G TCT T TGAAATATAACAT TATGCTGCCTGGAT GATATGCATAT 
TAAAACATAT T TGGAAAAC T GGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA& 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



ill 



p 

m 
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FIGURE q6 



pi 



1,1 

rn 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DDD FAVT T PAG I QRS T HAI EE AT TE SANG I Q 

Signal peptide: 
amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
H. CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
Q ACCAGCCT T GACCACGCCC TGGAGAGGACC TACCAGC TGT T GGTACAGGTCAAGGACATGGGTGA 

;r| CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
S| CCCTAGAGCCTATCCACC T GGCAGAGAATC TCAAAGT CCTATACCCGCAC CACATGGCCCAGG TA 

fl CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
* TGCAGAGGGAAACC T C TACGT GACCAGAGAGCT GGACAGAGAAGCCCAGGCTGAG TAC CT GC TCC 

III AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
pi ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
* m CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
f3 CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
HI TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
,p| ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
}j? CC T CGAGC C CGC CTTCCGCCT CAT GGAT T T T GC CAT T GAGAG GGGAGACACAGAAG GGAC T T T TG 

«jf GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
GJ GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
|I| AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCT CTGCACACCCCGCCAAGAC CATGGC T TGATCGTGAGT GGAC CCAGCAAGGACC CC GATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACCCAC TGGACCAT G T CAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCC CT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCT GGGAGAGAGC CCAGCACC CAAGATCCAGCAGGGGACAGGACAGAGTAGAAGC CC C TCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE o8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDWPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQS VAKLVGPGPGPGATATVTVLVERVMPPPKLDQE S YEAS VP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDIiASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI IPVWSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAI GI FLI 
LIFTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 



App_n>10063551 



Page 248 



FIGURE qo 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCT TCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTT TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T TCT C CT TATGTT TGG TCTACT AT TGCAT T TAGAAGCTGCAACAAAT TCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACT CT GAGT CCAGCACACCCT CCAGTGGGGC CAGCACAGTCACCAACT 
U CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
fn AGT AGGGCCAGCACTGCCAC CAAC TC TGAGTCTAGCACACTC TCCAGT GGGGCCAGCACAGCCAC 

CAACT CT GAC TCCAGCACAACCTC CAG TGGGGC TAGCACAGCCACCAAC TCTGAGT CCAGCACAA 
W CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
CI! GCCAC CAAC T CTGAGTCCAGCACAAC C TCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 

||| AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
||i GCACAGCCACCAAC TC TGACTCCAGCACAGTGT CCAGTGGGGCCAGCAC TGCCACCAACT CT GAG 

r« TCCAGCACGACC TCCAGTGGGGCCAGCACAGCCAC CAACT CT GAGTCCAGCACGACC T CCAGT GG 

GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
H CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
, AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
p| CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
;!? CC TCCAGTGGGG T CAGCACAGCCACCAAC T C TGAG TCCAGCACAAC CTCCAG TGGGGCTAGCACA 

H) GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
%3 CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
f|| ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
?4 ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
Jjt GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
'** TCTTCTTCTGTGT GAGAAACAGCCTGTCCCTGAGAAACACC T TTAACACAGC TGT CTACCACC CT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G T GGAG TCCTAACTGG TTC T GGAGGAGACCAGTAT CAT CGATAGCCAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCC CCGGAAGCAAGTGCC GCAT TCT TCAGGAAGGAAGAGACCT GGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCT TGAAGAAGGTATTCCTCACCT TTCTTGCCTT TACCAGACAC TGGAAAGAGAAT ACTATAT 
TGCTCATT TAGC TAAGAAAT AAATACATC TCAT CTAACACACACGACAAAGAGAAGCT GT GCT T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETS TSANTGSSVI SSGASTATNSGSSVTSSGVSTATI SGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSES S T T S SGAS TATNSE S S TVS S RAS TATNSE S S TT SS GAS TATNSE SRT T SNGAGTATNS E S 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNS ESS TVS S G I S TVTNS E S S T PS S GANTATNS E S S T TS S GANTATNSES S TVS S GAS TA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSAS TAVSEAKPGGSLVPWEIFLITLVSWA 
AVGL FAGL FFCVRNS L S LRNT FNT AVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 



Signal peptide: 

jm.it amino acids 1-20 



Transmembrane domain: 

amino acids 510-532 
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FIGURE 1Q1 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT GT GATATATGCAT TCCAGGGGAAGGGAAAT TGTGGTGCTTCT GAACCCATGG T CAAT T 
AACGAGGCAGTTTCTAGCTACTGCACGTACT TCATAAAGCAGGACTCTAAAAGCT T TGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAAC TGGGGAT GCAT TT GT TCCTGGAGAAAGAAGTGTCAT TAT CATGAACCATCGGACAA 
GAAT GGAC T GGAT G T TCC T GT GGAAT TGC C T GATG CGAT AT AG C T AC CT C AGAT T G GAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CT T CAT T CATAGGAAATGGAAGGATGACAAGAGCCAT T T CGAAGACATGAT TGATT ACT T TTGTG 
AT AT T CACGAACCACT TCAACT CCTCATAT TCCCAGAAGGGACTGATC TCACAGAAAACAGCAAG 
TCT CGAAG TAAT GCAT T TGCTGAAAAAAATGGACT TCAGAAAT ATGAATATG T T T T ACAT CCAAG 
AACTACAGGCTT TAC T T T TGTGG T AGACCGTCTAAGAGAAGGT AAGAACCT TGATGCT GT CCATG 
ATATCACTGTGGCGTATCCT CACAACAT TCC TCAATCAGAGAAGCACC T CC TCCAAGGAGAC TT T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT T T T TAT T T TACCGGACAGAG TGTCAT TCCACCT TGCAAGTC T GAAC T CAGGG TC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAAT AT T TGGTGGACTGGAGATCATAGAACT TGCATGT TACCGACT T T TACACAAACAGCCACAT 
T TAAATT CAAAGAAAAATGAGTAAGAT TATAAGGTT T GCCATGTGAAAACCTAGAGCATATT T TG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCC AT CAT TAT T T GT TAAAGATAT T TTGCAC T TAAT T T T GTGGGAAAAATAT T GCTACAAT T T TT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAAT CATCAGGC TTT TAAA 
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FIGURE 102 



MHSRGREIVVLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGS FFGSIFMLSP 
FLPLMFVNPS WYRW INNRLVAT WLTLPVALLETMFGVKVI I TGDAFVPGERS VI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHPCRWEEKEERLRSFYQGEfCNF 
YFTGQS VI PPCKSELRVLWKLLS ILYWTLFS PAMCLLI YL YSLVKWYF I IT I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC CT CT ACAGAGAT GGGGAAGAC TGGGAATC TAAGCAGAT GCCACAGTATCGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T TGT CT T CAGAC TCCAGAGCAAATGCAGATGGG T ACAGCCT GTATGATGTGGAGATCTCC 
AT T ATAGT C CAGGAAAATGC T GGGAGCATAT TGTGT T CCAT CCACC TTGCTGAGCAGAG T CATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T TCT GATCTGAAAAC TGTAACCCAT AGAAAAGC TCC CCAGGAGGT GCC TCACT CT GAGAAGAGA 
T T TACAAGGAAGAGTG TGGT GGC T TC TCAGGGT T TCCAAGCAGGGAGACAT TACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACT AT GAGGGTGGGACCATCTC CT TC TT CAATACAAATGACCAG TCCC T TATT TAT AC CCTGC 
TGACATGTCAGTTTGAAGGCTTGT TGAGACCCTATAT CCAGCATGCGATGTATGACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCT CCCGACAGGTGGCCCCAGCT T CCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGAC.T CACAT TAGGTTTAGT TTGT GAAAA 
C T CCAT CCAGCTAAGCGATC T TGAACAAGTCACAACC TC CCAGGCT CC TCAT T T GCTAG T CACGG 
ACAGT GAT T CCT GCC TCACAGGT GAAGATTAAAGAGACAACGAAT G TGAATCAT GCT T GCAGGT T 
T GAGGG C AC AG T G T T T GC TAAT GAT G T GT T T T TAT AT TATACAT T T TC C CAC CATAAAC T C T G T T 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
TGCTT T GTGAGGCTCAAAGAATAAAGAGGAGGTAGGAT T TT TCAC TGAT T C TAT AAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT TAAC ACAGACACAAAAAT T C T AAATAAAAT T T TAAC AAAT TAAAC TAAACAAT AT AT T TA 
AAGAT GATATATAAC TACTCAGT GTG G T TTG TC CCAC AAAT GC AGAGT TGGT T TAAT AT T TAAAT 
AT CAACCAGTGTAAT TCAGCACAT TAAT AAAGTAAAAAAGAAAAC CAT AAAAAAAAAAAAAAA 
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MAFVL ILVLS FYELVSGQWQVTGPGKFVQALVGEDAVFSCS LFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQG FQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS PNNGYWLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 




Signal peptide: 



amino acids 1-17 



Transmembrane domains: 



amino acids 131-150, 235-259 



p 
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CCTTCACAGGACTCTTCATTGCTGGTTGGCA ATGA TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGT TCAT T ATGTGAGAT AT AATCAAAAGAAG ACCT ACAAT T ACTAT AGC AC ATTGT CAT TT AC AAC 
TGAC AAACTAT ATGCT GAGT TT GGC AGAGAGGCT TC T AACAAT T TT AC AGAAAT GAGCCAGAGACT TG AAT 
CAAT GGTG AAAAAT GC ATT T TATAAATCTCCAT T AAGGGAAGAATT TGTCAAGT CTC AGGT TAT C AAGTTC 
AGTC AACAGAAGCATGGAGT GT TGGC TCAT AT GCT GTTGAT T TGT AGATTTCAC T CT ACTGAGGAT CCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CC TC ACTCAGTT AAAATT AAAAAAATCAACAAGAC AGAAA.CAGACAGCTAT CTAAACCAT TGCT GCGG A 
AC AC GAAG AAGT AAAACTCT AGGT C AGAGT CT CAGG ATCGTT GGTGGG ACAGAAGTAG AAGAGGGT GAATG 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TT GT GAGT GC TGCT CACTGT TT T ACAACAT AT AAGAACCCTGCC AG AT GGAC TGCTTCCTTTGGAGT AAC A 
AT AAAACCTT CGAAAATGAAACGGGGTCTCCGGAGAAT AATT GT CC AT GAAAAAT AC AAACACCCATCACA 
T GAC T ATG AT ATTT CTCTT GCAGAGC TTTCTAGCCC TGTTCCCT AC AC AAAT GC AGTACATAGAGTTTGTC 
TCCC TGAT GC ATCCT ATG AGTT TC AACC AGGT GAT GTG ATGT TT GTGACAGGAT TTGG AGCACT GAAAAAT 
GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 
p| TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
lf% GCCAGGGT GACTCT GG AGGACC ACT GGTTAGT TCAG ATGCTAGAGATATCTGGTACCTTG C TGGAATAGTG 

r| AGCT GGGG AGAT GAATGTGC GAAACC CAACAAGCCT GGTGT TTAT ACT AGAGT T AC GGCCTT GCGGGACT G 

HI GATT ACTT CAAAAACT GGTATC TAAGAG AC AAAAGC CTCATGGAAC AG AT AACATTTTTTTT TGTTTTTTG 

CI GGTGTGGAGGCCAT TT TT AGAGAT AC AG AATT GGAG AAGACT TGC AAAACAGCT AGATTTGACT GATCTC A 

III ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACT CT GT CATCTGT GAGCAATAGT TGAAACT TT ATGT AC AT AG AGAAAT AGAT AATACAAT ATT AC 
AT TAC AGCCT GT AT TC ATT T GT TCTC TAGAAGT T TT GT CAGAAT T T TGACTT GT TGAC AT AAAT TTGT AAT 
GCAT AT AT AC AATT TG AAG CACTCCT TTTCTT CAGTTCCTCAGCT C CT CTCATT TC AGCAAATATCC ATT T 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT ATTAGGTGTT TTT CTT AGTGGAAT ATT AGAAATGATCAT AT T CATT AT GAAAGGTCAAGC AAAGAC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT ATCCTTATTT TC ATTT CCAAAC AACT ACTATG AT AAAT GTGAAGAAGATT CTGTTTTTTT GTGACCT 
AT AAT AAT TAT ACAAACTTCAT GC AATGTACT TGTTCT AAGC AAAT TAAAGC AAAT AT TT ATTT AAC ATT G 
TT AC T GAG GAT GT C AAC ATAT AACAAT AAAAT AT AAAT CACCCA 
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MMYRPDWRARKRVCWE P WVI GLVI FI SL I VLAVC I GLT VHYVRYNQKKTYNY YS T LS FTTDKLY 
AE FGREASNNFTEMS QRLESMVKNAFYKS PLREE FVKS QVIKFS QQKHGVLAHMLL I CRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT T CNE P QAYNDAI T PRMLCAG S LEGKT DACQGDS GGPL VS S DARD I WYLAG 
I VSWGDECAKPNKPGVYTRVTALRDW I TSKTG I 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT TC TCACT T TAACT GCAGAAACGAGGG TAGAGGAAGCAGTCAT TT TGAC T TAC T T T CCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGT GAAAAGAAATCTGT T GCT TC T TGCATGGTACTT TGGAAGT TTGCTTGTCAT TT TCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACT TTGAAAGCCAGGATGACAAAT TATGGAT TACCTAGATATCGGTGGCT TACTCA 
!«b TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 

AAATGACAGAGATGGAC TGGCCCCCAGAT T CCTGC TGTGT T AGAGAAT TCCCAGGATGT T CCAAA 
J2{ CAGGC CCAC CAGGAAGATCTCAGT GACCTT TAT CAAGAGGGT T GTGGGAAGAAAAT GTAT TCC T T 

T T TGAGAGGAACCAAACAAC TGCAGGT GCTGAGGT TT CT GGGAAT C TCCATT GGGGTGACACAAA 
IF? TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
||| ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
Ij§ GT T GAAACCAAGCC T G T CAAGAAT CT T TGAACACACATCCATGGCAAACAGC T T TAAT ACACACT 

S TTGAGATGGAGGAGTT ATAAA AAGAAATGTCACAGAAGAAAACCACAAACTTGTTTTATTGGACT 
f l TG T GAAT T T T T GAG TACATACT AT GT GT TTCAGAAATAT G T AGAAATAAAAATGT T GC CATAAAA 

If 8 * TAACACCTAAGCATATACTATT CT AT GCTT TAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 

$ ACCAC C T GGACAATAAT TGATGCC CT TAAAATGCTGAAGACAGATG TCATACCCAC TGTGTAGCC 

f| TGT GTATGACTT TTAC TGAACACAGT TATGT TT T GAGGCAGCATGGT T TGAT TAGCAT T T CCGCA 

1% TCCAT GCAAACGAGTCACAT AT GGTGGGAC TGGAGCCAT AGTAAAGGT TGAT T TACT T CTACCAA 

fj CTAGTATATAAAGTACTAAT TAAATGCTAACATAGGAAGT TAGAAAATACTAATAACTT TTATTA 

C T CAGCGAT CTAT TC T TCTGAT GCTAAATAAAT TATATATCAGAAAAC T TT CAATAT T GGTGAC T 
fll ACCT AAATGT GAT T T T T GCT GGT T ACTAAAATAT TC T TACCACTTAAAAGAGCAAGCTAACACAT 

f| TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTT^AATCTGTATAATTCAGTCGAT 
|p T T CAGT TCT GATAATGT TAAGAAT AACCAT TAT GAAAAGGAAAAT TTGTCCT GTAT AGCAT CAT T 

m ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

TGTTAT TTAAATACT TAACCACTAATTTTGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
AGAAT G TAG TCTGGTCTT TAG GAAG TAT T AAT AAGAAAAT T T GCACAT AAC T TAG T T GAT TCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TAT CTTTGCCTTCT CCAAACAAGAAGCAATAGTC TCCAAGTCAATATAAAT TCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
C T T TGTT TT AT T TCAC T GAT TAAT AT AC TGT GG CAAAT T ACACAGAT TAT TAAAT T T T T T TACAA 
GAGTATAGT AT AT T TAT T TGAAAT GGGAAAAGT GCAT TT TAC T G TAT T T T GTG T AT T T TGTT TAT 
T TC TC AGAATATGGAAAGAAAAT TAAAATGT GT CAATAAAT AT T T TCTAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWLM^ 

VMI AVCCFL I IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTD^^ 

EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MS LKNDNS QHLS CPS VELLKPSLSRI FEHT SMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CTT T CT C GT GAGAGC CT AGAGGC CTT AAAAAAAAAAGT GCT T GAAAGAGAAG GGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
C CT CT T C ATCAC AGGC ACCT CC GT GT C ACCC GT GGCCCT AGAT CCT T GT T CT GCT T ACAT C AGC CT GAAT GAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
C GCACCT GTCT GGCT CAAT GGCAGCCACCCCCTAGAAGGCGACGGCATT GTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGC GTCT GCT TCCAC GTCT ACT GTGGT CATTTTT ATGACATCT GC GAC GAGGACT GC CAT GG 
C AGCT GC T C AG AT AC C AGC GAGT GCAC ATGC GCT CCAG GAACT GT GC TAGGC CCT GAC AGGC AGAC AT GCT T T G 
ATGAAAAT GAAT GTGAGCAAAACAACGGTGGCTGCAGTGAGATCT GT GT GAACCTCAAAAACTCCTACCGCTGT 
GAGT GT G GGGT T GGC C GT GT GC T AAGAAGT GAT GGCAAGACT T GT GAAGACGTT GAAGGAT GCC AC AAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGAT AACCACACT TGCCAAGTCC CTGT GTT GTGCAAATC AAAT GCCATT GAAGTGAAC ATCCCC AGGGAG 
CT GGTT GGT GGCCT G GAGCT CT T CCT GACC AAC AC CT C C T GC C GAGGAGT GT CC AAC GGC ACC C AT GT C AAC AT 
CCT CT TC T CT C T C AAGAC AT GTGGT AC AGTGGTCGATGT GGT GAAT GAC AAGAT T GT GGCC AGC AACC TCGT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC C GAAAT C ATGGG ATCT TCCC AT TC ACT C T GGAGAT CT T C AAGGAC AAT GAGTT T GAAGAGC CTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCT T GGT GGAGAGCT GCT T T GCC ACCC C C ACC TCCAAGAT C GACGAGGT CCTGAAAT AC TACCT C AT 
CCGGGAT GGCT GTGT TTCAGAT GACTC GGT AAAGCAGT ACACATCCC GGGATCACCT AGC AAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
C AGGT C AC AGC ACT GCT GAAC AAT GT GGCCT GGGT GGGGTT T C ATC T T T CTAGGGT T GAAAAC T AAAC T GT C C A 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAATCAG AAGCTGGGTATAATAT TTCAAGTTACAAACCCT AGAAAAAT TAAAC AGT T ACT GAAATT AT GA 
C TT AAAT ACCC AAT GAC TC C TT AAAT AT GT AAAT T AT AGTT AT ACCT T GAAATT T C AATT C AAAT GC AGACT AA 
TTATAGG GAAT TT GGAAGT GTAT CAAT AAAACAGTATAT AATTTT 
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m 



MPPFLLLTCLFITGTSVS PVALDPCSAYISLNEPWRNTDHQLDSSQGPPLCDNHVNGEWYHFTGMAGDAMP 
T FCI PENHCGTHAPWLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGG YYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNT SCRGVSNGTHVNIL FSLKTCGT WDWNDKI VASNLVTGLPKQT PGS SGDFI IRTS KLLI PVT 
CEFPRLYTIS EG Y VPNLRNS PLE IMS RNHG I FP FTL E I FKDN E FE E P Y REAL PTLKLRDSLYFGIE P WH V 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



Q Inportant features of the protein: 

fit Signal peptide: 

|U amino acids 1-16 



N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Hi Tyrosine kinase phosphorylation sites. 

y i 

f*$ amino acids 411-418, 443-451 

Q N-niyristoylation sites. 

Ill amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCT TGGGGT GACAATCT C AGCTCCAGGCTACAGGGAGACCGGG AGGATCAC AG AGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTT GTCCTCAT CAAGGTGATTCT GGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GT GT GACGGAGAGCT GGACTGT CC CTTGGGGGAGGACGAGGAGC ACTGTGT C AAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 

H 

^1 AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
m GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
||| GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
hi TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
Iff TC AGCATCCAGT ACGACAAACAGC ACGT CTGT GGAGGGAGCAT CCTGGACCCCC ACTGGGTCCT C ACGGCA 

H! GCCCACTGCT T C AGGAAACATACCGATGT GTTCAAC TGGAAGGTGC GGGC AGGC T C AG AC AAACT GGGCAG 

CT TC CCATCCCT GGCTGTGGCC AAGATCATCATC AT TGAATT CAAC CCCATGT ACCCC AAAGACAAT G ACA 
' T CGC CCTC AT GAAGCTGCAGT T CCCACT CACT TT CT C AGGCACAGT CAGGCC CATCTGTCTGCCCTT CTTT 

;!! GATGAGGAGCTC AC T CCAGC CACCCC ACTCTGGATC ATTGGAT GGGGCTTTACGAAGC AGAAT GGAGG GAA 

Ut 

!5L GAT GTCTGAC AT AC T GCTGC AGGCGTC AGTCCAGGT C ATTGACAGC ACACGGTGC AAT GCAGAC GAT GCGT 

%J 

H3 ACCAGGGGGAAGT C ACCG AGAAGATGAT GTGTGC AGGCATCC CGGAAGGGGGTGTGGACACCT GCC AG GGT 

ip| GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
||| CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 

GGAAGGCTGAGCT GTAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGT CAGACACAGAGCAAGAGT CC CCTTGGGT AC ACCCCT CT GCCC AC AGCCTCAGCAT 
TT CT TGGAGC AGCAAAGGGC CT C AATTCCTGT AAGAGACCCT CGC AGCCCAGAGGCGC CC AGAGGAAGTC A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGI P 1 1 1 ALLSLAS 1 1 I VVVL IKVTLDKYYFLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRS TLQVLDS ATGNWFSAC FDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KT PRWGGEEAS VDS WPWQVS I QYDKQHVCGGS I LDPHWVLTAAHCFRKHTDVFNWKVRAGS DKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMS DI LLQASVQVI DS TRCNADDAYQGEVTEKMMCAGI PEGGVDTCQGDSGGPLMYQS 
DQWHWG I VS WGYGCGGP S T PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCT GC T TAGAC CACCTGGT T TATGTGAC AGGAC TTGCAT TC T CC T GGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACT TGTGCCAAATTATGGGT CAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCT CCAAGGGCCCT T ATACCCCAGGAGACT T T GAT T TGAATT TGAAACC CCAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGCTAC T CGGGAGGCTGAGACAGGAGAAT TACTTGAACC TGGGAGG TGAAGGAGGCTGAGACA 
GGAGAAT CAC T TCAGCC TGAGCAACACAGCGAGACTC T GT C TC AGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE lid. 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRK.D 



Signal peptide: 
amino acids 1-15 



a 

a 

m 

w 

m 
m 
y> 

fa 

II! 
O 
fli 
P 

ru 
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FIGURE U5 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAAT CCT CCAGAGAATTGT GAAGACT GTCACAT TC T AAAT GCAGAAGC T TT TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCT ACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAAT T GAT CC T GTGAC CAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CAT CT AC T T CGT GGGTCT T CAAAAAT G T TT TATCAAAACTCAGAT TAAAGTGAT T CCT GAATT TT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACTTTCTTTGAACAGTCAGTG 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GATT T G TGATAACGTGACCATGTAT T GGATCAAT CCCACT CTAATAT CAGT T TCT GAGT T ACAAG 
ACTT T GAG GAGGAGGGAGAAGATCT T CAC T T TCCTGC CAAC GAAAAAAAAGGGAT T GAACAAAA.T 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC TT CCAATAAATGACTATACTGAAAATGGAATAGAAT TT GATCCCATGCT GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACT ACCCATATCCATACTGCTACCAAGGAGGACGAGT CATCTGTCGTGTCAT CATGCCT TG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCH I LNAEAFKSKKI CKS LK I CGLVFG I LALTLIVL FWGSKHFWPEVPKKAYDME 
HT FYSNGEKKKI YME I DPVTRTE I FRS GNGTDETLEVHDFKNGYTGI YFVGLQKCFI KTQ IKVI P 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGE DLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENG IE FDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWARMLGRV 



Important features of the protein: 
Signal peptide: 



A amino acids 1-40 

III 

Wl Transmembrane domain: 

Hi amino acids 25-47 (type II) 

iff 

u 

$ N-glycosylation sites. 

O amino acids 94-97, 180-183 

H 

S| Glycosaminoglycan attachment sites. 

Q amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 

fU 242 



N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 



Microbodies C-terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGC TCCCCT CAGGAGCGCGTT AGCTTC AC AC CT TCGGCAGCAGGAGGGCGGCAGC TTCT CGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCC AACATGAC ACTGAC CTCC GG GATC AT GTTC AT TGT C TCAGGTCT TT GTGC AAT TGCTGGAGTG 
TCTGTGTTTGCCAACATGCT GGTGACTAACTT CT GGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGC AGACTGTTCAGACC AGGT AC AC AT TT GGTGCGGC TCT GTTCGT GGGCTGGGT CGCT GGAGGCC 
TC ACACT AAT TGGGGGTGTGAT GATGTGCATCGCCT GCCGGGGCCTGGCACCAGAAGAAACCAACT ACAAA 
ff\ GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
||| TGGGT CC AAC ACCAAAAACAAGAAGATATACGAT GG AGGT GC CC GC AC AGAGGACGAGGT ACAATC TT ATC 

|f I CTTCCAAGCACGACTATGT GTAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
111 CCCAAAAAAC AAGGAG ATCCCATCT AGATTTCT T CTTGCT TT TGACTC ACAGCT GG AAGTTAGAAAAGCCT 

CGAT TTCATCTTT GGAGAGGCC AAATGGTCTT AG CCTC AGT CTC T GTCTCT AAAT ATTCCACCAT AAAACA 
GCTGAGTT AT TT AT GAATTAGAGGCTAT AGCT CACATTT TC AAT CC TCTATT TCT TTTTTTAAATAT AACT 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
T T CC T CCT AGT CAAT AAACCCATT GATG AT CT AT T TCCCAGCTT ATCCCCAAGAAAACTTT TGAAAGGAAA 
GAGT AGACCC AAAGATGT T ATT TT CTGCTG T T TGAATTTTGT CTCCCCACCCCC AACTTGGCTAGT AAT AA 
AC AC TT ACTGAAGAAG AAGC AATAAGAG AAAGAT ATTT GT AAT CTCTCC AGC CC ATGATCTCGGTTTTCTT 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AATAAATAAT GT TTT AGT AAAATGAT ACACT ATCTCTGTG AAAT AGCCTCACCCCT AC ATGTGGATAGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT CACACCTGTAAT CC T AGC ACTT TGGG AGGC TGAGGAGG AAGGAT C ACT TG AGCC C AGAAGT 
TCGAG ACT AGCCT GGGCAAC AT GGAGAAGCCCT GTCTCT ACAAAATACAGAGAGAAAAAAT CAGCC AGTC A 
TGGT GGCAT ACAC CTGTAGT CC CAGC ATTCCGGGAGGCTG AGGTGGGAGG AT CACTTG AGCCCAGGGAGGT 
TGGGGCTGCAGT GAGCC ATGAT CACACC ACT GCACTCCAGCC AGGTGAC ATAGC GAGATCCTGT CT AAAAA 
AATAAAAAAT AAAT AATGGAACAC AGCAAG T C CT AGGAAGTAGGT T AAAACT AATTCTTT AA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFAL^ 

CAI AGVSVFAE^LVTNFWMSTANMYTGMGGMVQTVQTRYT FGAALFVGWVAGGLTL I GGVMMC I A 
C RGLAPE E T N YKAVS Y HAS GHS VAYKP GG FKAS T G FG S NT KNKK I YDGGART E DE VQ S Y PS KH D Y 
V 



If! 
pi 

* 



Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



a 
m 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT CAGGAT GCAGTGCAAAATCTATGATTCCCTGCT GGCT CTT T CTCC GGACCTACAGGCAGC CAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
H CATCACGGGCAT GGT GGTGCT CAT CCC TGT GAGCT GGGTT GCCAATGCCATCATCAGAGATTT CTATA 

ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

W 

||l TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
Iff ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
M AAT CTTAAT TACAGGAACTGT GCATCAGCTATTTATGATT CTATAAGCTATTTCAGCAGAATGAGATA 

* TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCTTTTT ATCATTTAC T TCAAAATGACAT TGCTAAAGACTGCAT TATTTTACTACT GT AATTT CTCC 
ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 
TTT ATTTAAAAT GAAAT GCCAGTCCAT TACACTGAATAAATAGAACTCAACTATTGCTT TTCAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
HI TTT ATAAT GAAGATTAAAATGAAGGCT T TAATCAGCATTGTAAAGGAAAT TGAATGGCT TT CT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTT GTGTAT TAAATTAACATT TTTAAAACGCAGATAT TTT GTCAAGGGGCTTT GCATTCAAACT GCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATT T TTGTTTTT GTAT T TGAAGAAGAATGATGCAT TTTGACAAGAAATCATATAT GTAT GGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCA.TCAA.TATAAATAAAAGAGCA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACATT TATATTAAT AAAT TGTACATTT T TCTAATT 



II! 
n 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDSLLALSPDLQAARGLMCAASVMS FIAFMI^ILGMKCTRCTGDNEKV'KAHI LLTAGI I FI I TG 
KWL I P VS WVANAI I RDFYNS I WVAQKRELGEAL YL GWT T AL VL I VGGAL FCC VFCCNE KS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
C TCCGGCAGAGGGAGG T GGT GGACCT GTATAATGGAATGT GCTTACAAGGGCCAGCAGGAGT GCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAAT G T C TGAGGGAAAGC TT T GAGGAGT CCTGGACACCCAAC T AC 
AAGCAGT G T TCAT GGAGT TC AT TGAAT TATGGGATAGAT C T T GGGAAAATTGCGGAGT GTACATT 
?1 TACAAAGAT GCG TT CAAATAGT GCT CTAAGAGT T T T GT T CAG T GGCTCACT TCGGCTAAAAT GCA 

If! GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
W ATTGAAGCTATAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAATTAATATTCATCG 

CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT TGAAGAACTACCAA& ATAAA TGCTT TAAT TT T CAT T TGCTACC TC T T TT T T TAT TATGCC 
T T GGAATGGTTCACT TAAAT GACAT TT TAAATAAGT T TAT GTATACATC TGAATGAAAAGCAAAG 
C T AAATAT GTT TACAGACCAAAG T GTGAT T T CACACT GT T T T TAAATC TAGCAT TATT CATT T TG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 



f|J AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 

TAAAT AAAAAT TATT TCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKSlQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 



Signal peptide: 

amino acids 1-30 



^ Transmembrane domain: 

£l| amino acids 195-217 



hi 



in 



r! 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCT GGAAGAATACAT CAT GT TT TT CGATAAGAAGAAAT T GTAGGATCCAGT T TT T T TT TTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC TAAGCCAAAGCAAAAGACC TAAGGACGACC T TT GAACAATACAAAGGATGGGT T TCAATG 
TAAT TAGGCTAC T GAGCGGATCAGCT GTAGCAC TGGT TAT AGCCCCCACTGTCT TACT GACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCT T CAAAAAC TT AAGT AT AATCAAT T TAAAGGGCT CAACCAGC TCACC TGGCTATAC 
C TT GACCAT AACCATATCAGCAAT AT TGACGAAAATGCTTT TAAT GGAATACGCAGAC TCAAAGA 
GCT GAT T CT TAGT TCCAATAGAAT CT CCTAT T T TC T TAACAATACC T TCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTGGCATGATC^GACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTC^AC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATCGATGCAG T GAAGAACTACAGCAT CTGTGGCAAAAG TAC TACAGAGAGGT T T GATC TGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
T GC CC CC GACGGT GGGAGCCAC AGAGCCCGGCC CAGAGACCGAT GC TGACGCCGAGCACATCT CT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAAT GACTCCCAGCACCCAGGAATTT TATG TA 
GATTATAAACCCACCAACACGGAGACCAGCGAGAT GC T GCT GAATGGGACGGGACC CT GCACC TA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGT ATGAA CCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT T T T CC T C TCAT AC ATAAT CAAC C CAT T GAAAT T T AAATAC CACAAT CAAT G T G AAGC T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
G T TT TAAGATAAAAC T TCT T TCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFWIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEI 

LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 

PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 

LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 

SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 

WLKSFKGLRENTIICASPKELQGVWIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPm 

SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 

SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGT TATCGTCT TGCGCTACTGCT GAATGT CCG I'CCCGGAGGAGGAGGAGAGGCT T T T GCCGC TG 
ACCCAGAGATGGCCCCGAGCGAGCAT^ATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCT T TCC CC T GGAT CTCAC AAAAAC TCGACT CCAAATGCAAGGAGAAGCAGCT C T TGC TC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCT TTCTA7\AGCT TTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGC CAAT CCAAC T GAC C T AGT GAAG G T T CAGAT GC AAAT GGAAGGAAAAAG GAAAC T GGAAGG 
AAAACCAT T GCGATT TCGTGGTGTACATCATGCATTT GCAAA71ATCTTAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTTAT GATACAGTGAAACACTACTTGGTATTGAATACACCACT TGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGT GGAGTCAGTCCAT T T TAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKR^ 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCCTCTCCATGT TC TCAGCTAT CCGT TCT CAGCACAGCGG TGT AGACATC 
s 5 TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
« GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
Q AGGAGCGGAATGT GGACGAT GGGCACATCATTAACATCAATAGCATGTC TGGCCACCGAGTGT TA 

ill CCCCT GT CT GTGACCCAC T T CT ATAGT GCCACCAAG TATGCCGT CACTGCGC TGACAGAGGGACT 

W GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

111 

AGACACAAT TCGCCT T CAAACT CCAC GACAAGGACCC T GAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAG T G T CTCAAAC CCGAGGAT GT GGCCGAGGC TGT TAT C T ACG T CC TCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGAT GAGGCCCACGGAGCAGGTGACCTAGTGACT GT GGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCAC T TC CTGT CCACAC CCCGACCAGGGGCTAGAAAAT T TGT T TGAGAT TT TTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
Q TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
f|| TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
C AG AAC AC C AGG GC C T GG C C CAG T GG AT T T CAT GG T GAT CAT T AAAAAAG AAAAAT CG C AAC C AA 
AAAAAAAAAA 



II! 
h 

m 
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FIGURE 128 



MARPGMERWRDRLALVTGASGG I GAAVARALVQQGLKWGCARTVGN IEELAAECKSAGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIJ^PDTLLSGSTSGWKDMFNVWLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 




N-myristoylation sites. 



amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 



199-205 




Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 



III 
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FIGURE 12Q 



AAC'JL'T CT ACATGGGCC TCCT GCTGCT GGTGC TC T TCC TCAGCCTCC T GCCGG TGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT T GAGT GCT T GAAATGAGGAACT GAGAAAAT TAAT TT C TCAT GTAT TT TTC TCAT T TAT TTA 
TTAATT TT TAAC TGATAGTT GTACATATT TGGGGGTACATGTGATATTTGGATACATGTATACAA 
TATATAATGATCAAATCAGGG T AACT GGGATAT CCAT CACATCAAACAT TTAT T T T T TAT T C T T T 
U TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 

CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

if! 

r | GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
|f| CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
HI GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
?* k ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
L ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
js| AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
fl AG TAAGAAAATGCAATAT T T GT CT TT C TGT GCCTGGCT TAT T TCAC TTAACATAAT GAC T TCCTG 

ill T T CCAT CCAT GT T GC T GCAAAT GACAGGAT T TCGTTC T TAAT TTCAAT TAAAATAACCACACATG 

M GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVARSHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 



App_ID=10063551 



Page 280 



FIGURE lai 



TT CT GAAGT AACGGAAGCT ACCTT GT AT AAAGACCT C AAC ACTGCT GACC ATGA TC AGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
T T GAAAAACT TGCT GAAAGT AGAAAG AAT ATACC T ATACC ACAAC AGT TT AGAT GAATTTCC TACC AACCT 
CC CAAAGT AT GT AAAAGAGTT ACAT TTGCAAGAAAATAACAT AAGGACT ATCACT T ATGAT T CACTTTC AA 
AAAT TCCCTATCT GGAAGAAT T ACATTT AG ATGAC AACTCTGTCTCTGCAGT T AGC AT AG AAGAGGGAGC A 
g ; TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
p.% GCCC AGGACT AT AGAAGAACT ACGCTTGGATGAT AATCGC AT ATCC ACT AT T TC ATC ACCAT CT CTTC AAG 

p| GT CT C ACT AGTCTAAAACGCCT GGTTCT AG AT GGAAACCT GT TGAACAAT CATGGTTT AGGT GACAAAGTT 

|f! TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
p| T CCAGGC ACAAACCTG AGGAAGCT T T ATCTTCAAGATAACCACAT C AATCGGGT GCCC CC AAAT GC TT TTT 

III CT TATCTAAGGCAGCTCT AT CGACT GGATATGTC CAAT AAT AACCT AAGT AAT T T ACC TCAGGGT ATCTTT 

If I GATGATTTGGACAATATAACACAACTGATTCTTCGC AACAAT CCCTGGTATT GC GGGTGCAAGATG AAATG 

s GG TACGTG AC TGGT T AC AATCACT ACCT GTGAAGGTC AACGT GCGTGGGCTCAT GTGCCAAGCCCC AG AAA 

^ AGGT TCGTGGGAT GGCTATT AAGGATCT CAATGC AGAACTGT TT GATTGT AAGGAC AGTGGGAT TGT AAGC 

s « ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 

ul 

C AAAC AGCCAGATAT T AAGAACCCCAAGCT CACT AAGGAT CAACAAACC ACAGGGAGTCCCT CAAGAAAAA 
«|. CAAT T AC AAT TACT GT GAAGTCT GTC ACCT CT GAT ACCAT TC AT ATCT CTTGGAAACT TGCT CT ACCT AT G 

pi ACTGCTTTGAGACT CAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGAT CT AT AACAGAAAC AATTGT 

f|| AACAGGGG AACGCAGT GAGT ACT T GGTC ACAGCC CTGGAGCCT GAT TC ACCCTAT AAAGT AT GCATGGTT C 

CC AT GGAAAC CAGC AACCTCTACC T ATTT G AT GAAACTCCTGT T TGTAT TGAGACTGAAACT GC AC CCCT T 
CGAAT GTACAACCCT ACAACCACC CTCAATCGAGAGCAAG AGAAAG AACCT T AC AAAAACCC CAAT T TACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T T CAT AGGAAT GGATCGCT C TT CT C AAGGAACTGT GC ATATAGC AAAGGG AGGAGAAG AAAGGAT G ACT AT 
GCAGAAGCTGGC AC T AAG AAGGAC AACT CT AT CC T GGAAATC AGGG AAACT T CT TTTC AG AT GT T AC CAAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAAC AAT C ACAGTG AAAGCAGT AGTAACCGAAGC TACAGAGACAGT GGTAT TCCAGAC TC AGAT CAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNTOGIMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDS PYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
P PYKNPNLPLAAI I GGAVALVT I Al LALVCW YVHRNGS L FS RNCAYSKGRRRKDD YAEAGTKKDNS 

|f! I LE IRETS FQMLP I SNEP I SKEE FVIHT I FPPNGMNLYKNNHSE5 SSNRSYRDSGI PDSDHSHS 



U 



is 



Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
HI amino acids 531-552 

N-glycosylation sites* 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-4 4 
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FIGURE laa 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTC ATG TCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
p| CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
HI GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACT CCAACAGAGCCAGGACTAT ATCAACC TCTTCTGCGC CAACATGATGGAC TT GAACCGC 
AGAGCTGAGGCCATCGGATACGCC TACCC TACCCGGGACAT CT T CAT GGAAAAC AT CATGT TC TG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGAT GCT T C GGGGAGCCTGATGC TGAAGATGAAGAATTAT C TAAAGCTAT T CAA 
TAT CAGCAGCAT T T T T CGAGGAGAG T GAAGAGGCGAGAAAAACAATT TCCAGAT TC TCGCTC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC T T TCAAAGAGACTGT GT TGAATAAAGGGC CAAGG T TCT TGCCACCCAGCAC TC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
G T GG C T T C C TATACAT CC T G GCAGAATAC C C C C CAG C AAAC AGAGAGC CACACC CAT C CACAC CG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAMA 



5 , S 
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FIGURE 134. 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINL FCANMMDLNRRAEAI GYAYPTRD I FMEN IMFCGMGG FS DFYKLRWLEAI LSWQKQQEGC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE ias 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTC'X'TTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGAT TTGATCCCGT T CGAGTGGATATCACT TC GAAAGGAAAAATGAGAGCAAGATATG 
TGAAT TACATCAAAACATCAGAGGTTG TCAGAC TGCC CT AT CC TC TCCAAATGAAATC T T CAGGT 
C CACC T T C T TAC T T TAT TAAAAGG GAAT C G T GGGGC T GGAC AGAC T T T C TAATGAACC CAAT GG T 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
Zl : C TGACATGAGACGGGAAATGGAGCAGTCAATGAATAT GCTGAAT TCCAACCATGAGT T GCCT GAT 

J! 

if G T TTC T GAGTTCATGACAAGACTCTT C TC TTCAAAAT CATC TGGCAAAT CTAGCAGCGGCAGCAG 

If TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACT GGGTGGCAT CCAAGTC TTGGAAAACCG T GT GAAGCAAC TACTATAAAC TT 
GAGTCATCC CGACGT TGATC TC T TACAACTG TG TATGTT 
p| AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
|f! GAGG T CAAT AT T GATGTCACTGAAT TAAT TACAGTGT CCTATAGAAAATGCCAT TAAT AAAT TAT 

Cl ATGAACTAC TATACATTATGTATAT TAAT TAAAACAT CT TAAT CCAGAAATCAAAAAAAAAAAAA 

W AAAAAAAAAAAAAAA 

rH 



in 

f =,- 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHD I PS GS YWEWS PAYRFDPVRVD ITS KGKMRARYVNY I KT SE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFIJOTPMVMMM^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 



||! Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 



Ws Glycosaminoglycan attachment sites. 



amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 



H AAGCT GTGT GAT CGCCAC AAACC T TC AGGAAATACGAAATGGAT TT TCT GAGATAC GGGGCAGTG 

W TGCAAGCCAAAGATGGAAACAT TGACAT CAGAATCT TAAGGAGGACTGAGT CT TTGCAAGACACA 

IS AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 

CP 

I g I AAACTACCAGAC CCCT GACC AT TATAC T C T CCGGAAGATCAGCAGCCTCGCCAAT T CCT T TCT TA 

Iff CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 

111 AAGAAATACAGCCAGAT TC T GAG TCACT TTGAAAAGC T GGAACC TCAGGCAGCAG T TGTGAAGGC 

u 

TT T GGGGGAACTAGACATT CT TCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAG TGATGC TG 

JL CT GCT AAGAATAT TCGAGGT CAAGAGC T CCAGTCT TCAATACC TGCAGAGGAGGCATGAC CCCAA 

|S ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 

fff C T T CCTT GCATGATTG TC T T TAT GCATCCCCAAT C TT AATT GAGACCATACT TGTATAAGAT TT T 



TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
ATGTAT T TAT T T T T T TAC T T GGACATGAAAC TT TAAAAAAAT T CACAGAT TATAT T TATAACC TG 
ACT AGAGCAGGTGATG TAT T T TTATACAGTAAAAAAAAAAAACCT T GTAAATTC TAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATC TACCT T ACAAT TACTGACCAT CCCCAGTAGAC TCCCCAG TCCCATAAT TG T GT ATC T TCCAG 
CCAGGAATC CTACACGGCCAGC AT GT AT TTC TACAAATAAAG TT T TCT T T GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 128 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 



Important features of the protein: 
J..,, Signal peptide: 

pi amino acids 1-42 

111 

4y cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

£ 3 i 

M 

N-rnyristoylation sites. 

Q amino acids 42-47, 46-51, 136-141 

i 



m 
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FIGURE 13Q 



in 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCAC CT CT CT T CAGTAAAGT T G TTAT TGT TCTGATAGAT GCCT T GAGAGAT GAT TTTGTGTTTG 
GG T CAAAGGGTGTGAAATT TAT GC CC TACACAAC TTACCT T GT GGAAAAAGGAGCATC TCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC T GGAAAAAGAATAG T CT T T T AT GGAGAT GAAACCT GGGT TAAAT TA 
T TCCCAAAGCATT TT GTGGAATATGATGGAACAACCT CAT T T T TCG T G TCAGAT TACACAGAGGT 
U GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
PI TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
S AAGCT GAGC GAGATGGACAGCGTGCT GATGAAGATCCACACCT CAC T GCAGT CGAAGGAGAGAGA 

GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
IP"! GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
III CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
If! ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 

CAATGAGAGAGCAGT TGAGAT T T TTACATT TGAATACAGT GCAGCT TAGTAAACTGTT GCAAGAG 
AAT G T G C C G T CAT A T GAAAAAG AT CCTGGGTTT GAG C AG T T T AAAA TGT C AG AAAG AT T G CAT GG 
^ GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
s. TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q. TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
m CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
%l CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
If GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
I'll AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
p TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
fll CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C TGAC CGAGACACTCACAGCT T T GTCATCAGGGCACAGGC T TCC TCGGAGCCAGGATGAT C T GTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE lao 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKraPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDI L ILHYLGLDHI GH I SGPNS PL I GQKLSEMDS VLMKI HTS LQSKERET PLP 
NLL VLCGDHGMS E TGS HGAS S TEEVNT PL I L I S SAFE RKPGD I RHPKH VQ 



m 



I? 3 



3 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE lai 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAAT T CAGCATTCATGACCAGGATCACAAAGTACT GGT CCT GGAC TCT GGGAATC TCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCT TCT TT GCAT TAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT TGGGGTGACAGATAAAT TTGAGAACAGGAAACACAT T GAAT T TTCAT TTCAACCAGT T TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLE YNFP IENNCQHLKTTHT FRVKNLNPKKFS IHDQDHKVLVLDSGNL I AVPDKNY I RPE I 
FFAIJ^SLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWmLESAMiPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cQ4P-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-rayristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAG T ATAGGGCAGAAGGAT GGCAGATGAGT GACT CCACATCCAGAGCTGCCTCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
N CACAT CT TGAAT TC CTGGAAGGAGCAGGTGGAG TCCAAGACT GTAT TC TCAATGGAGCTAC T GC T 

W GGGGAGAACTAGGTGT GGGAAAT TTGAAGACGACAT T GACAACTGCCAT TTCCAAGAAAGCACAG 

AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
M C TCCT GAACAAGACCTGC T TGGAGGGAT TCCAC TGAGTGA&AC CCACTCACAGGCTTGT CCATGT 

|ff GCT GCTCCCACAT TCCGTGGACAT CAGCAC TAC TCTC CT GAGGAC TC T TCAG T GGCTGAGCAGCT 

HI TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
*** ACACATCTTGAGCCTAATCATGTAGT GTAGATCATTAAACATCAGCATTT TAAGAAAAAAAAAAA 

m AAAAAAAAAAAAAAA?^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

in 



til 
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FIGURE 144 



MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHSQRDCDEHOTMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMT QFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



01 



C;5 



N-glycosylation sites. 

amino acids 117-121, 139-143 




N-rayristoylation site. 



amino acids 9-15 



p 

II! 
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FIGURE las 



C IGT GCAGCTCGAGGC TCCAGAGGCACAC ICCAGAGAGAGC CAAGGT T CTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT T CGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACT GGCAG T TCCCCGAT GGCAT CCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACC TGCT T CAC AGTACT T CCCAACAACT C TTAGAGGTAG 
G TGTAT TCC CGT T T TACAGATAAGGAAAC TGAGGCCCAGAGAGC T GAAGTAC TGCACC CAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TCTAAT GAAATTGTGAAAGCT CCATG TT TAGAAATAAATGAAAACACCT GA 
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FIGURE 146 

MRKHLS WWWLATVCMLL FSHLSAVQTRG I KHRI KWNRKALPS TAQI TEAQVAENRPGAFI KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQi^QGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CXGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T TCAGAT TCAGGGGGTACATGTGAAGGT TTGT TT TATGAGTATAT TGCATGATGCTGAGG 
TTTGGGGT 



W 

III 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

fw\ amino acids 62-68 

ii 

Ill 

If! 

fll 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTGTACATAATAGAGAATT AAGTGCAGAAAGACCT TT GAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT T GATAACTT GAAC CTGCT AAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAATCTA 
TAAGAAGC TCCCCACTT GATAATAAGT TGAAT GTGGAAGAT GTTGAT TCAACCAAGAAT CGAAAACTG 
ATC GATGATTAT GACTCTACT AAGAGT GGAT TGGAT CATAAATT T CAAGATGATCCAGATGGTCTTCA 
T CAACTAGACGGGACT CCT T T AACCGC TGAAGACAT T GT C CAT AAAAT C GCT G CC AGGAT T TAT GAAG 
AAAAT GACAGAGCCGTGTTTGACAAGATT GT TT CTAAACTACTTAATCT CGGCCTTAT CACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAAT TAATCT CAAAGGAAGCCAACAA 
TTAT GAGGAGGATCCCAATAAGCCCACAAGCTGGAC TGAGAATCAGGCT GGAAAAATACCAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
ATATTTCCCAAATTT CTATGCGCTACT GAAAAGTAT TGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAGATGATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAA.GATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATT TAT AGCAGCCTGTAAAAATGGCAAAAGAT CCAGGAGTCTTTCAA 
CTGT TTCAGAAAACATAATAT AGCTTAAAACACTTCTAAT TCT GT GATT AAAATTTTTT GACC CAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 15Q 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKTKKTYPPENKPG 
QSNYS FVDNLNL LKAI TEKEKIEKERQS I RS S PLDNKLNVEDVDS TKNRKL I DD YDS T KSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTLIDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLE AI RKNI E WLKKHDKKGNKEDYDLSKMRDF INKQADAYVEKG I LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites : 

amino acids 143-148, 239-244 
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FIGURE isi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATGG TCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGC TTCTAGCTGGAGGGCTGCATGCAGGGAAGGT CAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAG T GCC TG TCAT GTGGGGTGGGGCAGGAGCC GACT CTAACACTAGAGC CAGT GAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
5 , TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
!Z CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
p CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
O TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
ff| GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
71 GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
JJj TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
W I GAAAT GGC TCGAGC TCAGAAGATAAAAGATAAG TAGGGTAT GCTGATCCTCT TT TAAAAACCCAA 

||1 GATACAATCAAAATCCCAGATGC T GGTCTCTAT TCCCAT GAAAAAG TGCT CATGACATATTGAGA 

y* AGACCTACTTACAAAGTGGCATATATTGCAATTTATTTTAATTAAAAGATACCTATTTATATATT 
* TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
J AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
W? AT T CCT TG TATAAAAATAAGAAAAGAAAT TAATCT TGAGGTAAGCAGAGCAGACATCAT CT CTGA 

iff TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
Q TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCAC T GAGGAGGAGGCTGT GCT GAGT TTGT 

p| GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
IJt GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
O AT T GT GT CCCCC TCAAAT TCACAT CC T T C T TGGAAT CTCAGT C TGTGAGT T TAT T TGGAGATAAG 

III GTCTCTGCAGATGTAG T TAGT T AAGACAAGGT CATGCTGGAT GAAGGTAGAC CTAAAT TCAATAT 

GACTGGT T TCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGATCGGAGT T T T GCAGCCACAAGCTAAGAAACACCAAGGAT TG TGGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAAT TCT TCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATAT GGCAGAAG TGATG GCAT GCCAC TTCCAAGAT TAGGT TATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC T AGC TGCCATGCTAT GAGCAGGCC TATAAAGAGAC T T AC G TGGTAAAAAAT GAAG TCT CC T 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCT GTGAGATAATCGATGT TTGT TGT TTT 
AAGT T GC TCAGT T T TGGTC TAACT T GTTAT GCAGCAATAGATAAATAATAT GCAGAGAAAGAG 



App_ID=10063551 



Page 301 of 3 18 



FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLIAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites* 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

C TT CAGAACAGGTTCTCC T TCCCCAGTCACCAG TT GCTCGAGT TAGAAT TGTCT GCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGC TGAAGGACACAGT GAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAAT TAGATGCCCCAAAGCGATTT TT TTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T T TCTAAGCATAGATAT T TAT TGAT AACATT TCAT TGTAACT GGT GT TC 
TATACACAGAAAACAATTTATTTTTTAAATAATTGTCTTTTTCCATAAA?^GATTACTTTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTAT T TAT TAT TAT TATAAGACTGCATT T TAT TTAT AT CAT T T TAT TAATATGGATT TATT TAT 
AGAAACAT CAT TCGATATT GCTACTT GAG TGTAAGGCTAAT AT TGATATTTATGACAATAAT TAT 
AGAGC TATAACAT GT T TAT TTGACCT CAATAAACACT T GGATATCCC 



FIGURE 154 

MAALQKSVSSFLMGTIATSCLLLLAL^ 

ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

!! amino acids 54-58, 68-72, 97-101 



m 



m N-myristoylation sites. 

r H amino acids 14-20, 82-88 



55. 



ru 



f!S 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE as 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCC GAC T TGTGACTGAGT GTGCAGTGCCCAGCAT G TACCAGGT CAGT GCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGG TGAGGACAGT TC TC TCATTAGCC T T T T CC TACAGGTGG TTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 

If i 

|s| GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
III AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 

W GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

Lis 

ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 

« CAGGATCCC GGGACAGGATGGGGGGC T T TGGGGAAAACCT GCACT TCTGCACATT T TGAAAAGAG 

tjf CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

U GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 

{I! GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 

ji* TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 

8 41 

TTGACAT GGATGATT CTGAGGAGGAAGCTGT TAT T GAATGTAT AGAGAT T TATCCAAATAAATAT 
C T T TAT T TAAAAATGAAAAA 
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FIGURE 1*6 



MRERPRLGEDSSLISLFLQVVAFIJ^MVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 



N-glycosylation site. 

amino acids 136-140 

w 

111 Tyrosine kinase phosphorylation site. 

amino acids 127-135 

pi N-myristoylation sites. 

||! amino acids 44-50, 150-156 



III 
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FIGURE 1S7 

CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGT T CAAT G T GGC TC TGAAACTGGGCCATC TCCAGAG T GGATGCTACAACATGAT CT AAT CC 
CCGGAGAC T TGAGGGACC TCCGAGTAGAACCTG T TACAAC TAGTGT TGCAACAGGGGACT AT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT T GT GT GACGGGCAAAAGCAACT T CCAG TCC TACAGCT GTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGTC TAT T T CAT T GGGGCCCATAATAT TCCTAAT GCAAATATGAATGAAGATGG 
N CCCT T CC AT GTC T GTGAAT T TCAC CT CACCAGGC TGC CTAGACCACATAAT GAAATATAAAAAAA 

Si AG TGT GT CAAGGCCGGAAGCCTG T GGGATCCGAACAT CACT GCT T GTAAGAAGAAT GAGGAGACA 

HI G TAGAAGTGAAC T TCACAACCACT CCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 

||| TATCATCGGGT TT TCTCAGGTGT TTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 

10 TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
J* 1 AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
p CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
111 TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

1*1 TCACACAAT TTGT TAC T TCACTGAAT T TC T TCAAAACCAT T GCAGAAG TGAGGT CATCCT T GAAA 

I?! I 

!.:f AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 

?*a GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 

III 

GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G TGAT CT AAGAAGCCAGAT T CAT C T G C AC AAAT AC G T GGTGGT CTACT T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NAlsnsINEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKN 

STIIGFSQVFEPHQKKQTRASVVIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKBerSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAAC ATGA CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
AT CAT CAAT GAAAACC AG CGCGTTTC CAT GTCAC G TAAC AT CGAGAG C CG C T CC AC CT CC C CC T G 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GG T GACT G T TGGCTGCACCT GC G TCACCCCTGTCATCCACCAT G T GCAGTAAGAGGTGCATAT CC 
AC T CAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-rayristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 

ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
AGAGAGAGTGCACAAAC TACCCAGCACAGCCCC CTCCGC CCCC T CTGGAGGC TGAAGAGGGATT C 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGT TTGGAGGAGCAGCTGACT CAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
H TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
C3 GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
n ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
Si GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
f I CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
W CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
HI CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
|f| GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 

TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
« CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
C3 GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
II! CAAGACCT GCAG TCAGGC CAGT GT CT GCAGC TATGGGACGATGACT T GGGAGCGCT AT GGGCCTG 

m CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
JfJ CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
IW CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
13 CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
f|| CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAMjVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEE^LNHTQQLPALPWIJNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHE^NLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENWVDKVTLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY I H KRWALVWL AC LL FAAALS L I LLLKKD HAKGWL RLLKQDVRSGAAARG 
RAALLL Y S ADDS G F E RLVGALASALCQL PLRVAVDLW S RRE L S AQG P VAW FH AQ RRQTLQEGGVWLL F S P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-58 6 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myi:istoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 162 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCT CAAGCCACC TGATGTGACCTGTATCTCCAAAGTGAGAT CGAT TCAGATGAT TGT T CATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TCTACCACT TAGAGCTCCAGGTCAACCGCACC TACCAAAT GCACC T T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
p% CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
Jit TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
p AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
III TCCACAGCGGCATAGCCT G T CCGAGAT CACCTAC TTAGGGC AGCGAGACATC TC CATC CTCCAGC 

If] CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
[n GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
a " CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
* CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
0 TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
ill CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
*■» CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
III TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
© GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
Iff AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGTGCAAGGAGGAAAT GCAGGGAAAC TCCCGAGGTCCAGAGCCCCACC TC 
C TAACACCATGGATTCAAAGTGCTCAGGGAATT TGCCTCTCCT TGCCCCAT TCCTGGCCAGT T TC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGTC CACCAGGGAGGGAGACACACAAGCC T T T T CT GCAGGCAGGAGT T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE l6d 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRI TRKS CNL TVET GNLTEL YYARVTAVSAGGRSATKMTDR FS S LQHT TLKP PDVTCI S 
KVRS I QMI VHPTPT P IRAGDGHRLTLED I FHDLFYHLELQVNRT YQMHLGGKQRE YE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSIAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AG T G T TGAACACAG T T TT GGATAT TT TCCAAAAGAT T TGAT CAAGGTACTTCATAAAT ACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACT T T GT CTGCT T TGAAGGAGGAAGAGATGAT T 
T TAATAGT T ATAATGTAGAAGAGC T T T TAGGATCT T T GGAACTGGAGGAC TC TGTACC TGAAGAG 

TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 

Q 

i f% ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 

III CAGAGAGCACCGAGGGGC TGCAGGGACAGCC CT CAGCTC AGGAGAGCCACCC TCAC ACCAGCGGT 

HI CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 

III AT T GAAAG T GCC GGGAAGCGAAAGCAGAAC TGGCAAT AGT T CTCCTGCC TCGG TGGAGCGGGAGA 

" AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 

H«j CATTACAGCAAAGGAT T TCGT T GGCATCAAAATC TAAG TT T GTT T TACAAAGATTGT TT T TAGTA 

|ff CTAAGCTGCCTTGGCAGTTT GCAT TT T TGAGCCAAACAAAAATATATTATTT TCCCTTCTAAGTA 



AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLIKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKT DAYKVLKTEMS QRGS GQC VI H Y SKG FRWHQNL S L FYKDC F 



p "2 



pi amino acids 67-7 6 



Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 



N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGT T GCAAAAC CTGACAAAACCCGGG 
U ACAGGGAT TTC TGATGGTGAT T TCT GGATAGGGCTT T GGAGGAATGGAGAT GGGCAAACAT CTGG 

TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
SS ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C TT GGGGGTCCC TACC TT TACCAGTGGAATGAT GACAGGTGTAACAT GAAGCACAAT TAT AT T T G 
ill CAAG T ATGAACCAGAGATTAAT CCAACAGCCCC TGTAGAAAAGCCT TATC T TACAAAT CAACCAG 

111 GAGACACCCATCAGAATGT GG T T GT TACTGAAGCAGGTATAAT T CCCAATCTAAT TTATGT TGT T 

in ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
IP GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGT CTACACT GT GGAT TTCAAAGAG TA 

CCAGAA&AGAAAGT GGCATGGAAGTATAATAAC TCAT TGACTT GGT TCCAGAAT T T TGTAAT TCT 
N 5 GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAAT GGCT TGAAATCACAAAGGATCTGC 

t AAGAT GAACT GTAAGC TCCCCCT T GAGGCAAATAT TAAAG T AAT T TT TATAT GT C TAT TAT T TCA 

H T TT AAAGAATATGCTGTGCTAATAATGGAGTGAGACAT GCT T AT T T TGCTAAAGGATGCACCCAA 

ACT TCAAACT T CAAGCAAATGAAAT GGACAAT GCAGATAAAG T T GTTATCAACACGTC GGGAG TA 
JJ.* TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
**l T G T AT ATT GTAT TGAAAT TT ACAGTGTGCAAAAGTAT T T TACCTT TGCATAAGT G T T T GATAAAA 

fll ATGAAC TGT TC TAATATT TATT T T TAT GGCATC TCAT T T T T CAATACATGCTCT T T TGAT TAAAG 

P% AAAC TTAT T ACTGTTG TCAACT GAAT TCACACACACACAAATATAGTACCATAGAAAAAGT T T GT 

m TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 

AAT AAGAAGC TAT T TCAT TAAGT GTGATATAAACC T CCT CAAACAT T T TACT TAGAGGCAAGGAT 
TGTCTAAT TTCAATTGTGCAAGACAT GTGCC TTATAATTATT TT TAGC TTAAAATTAAACAGATT 
T TGTAAT AATGTAACT T TG T TAAT AGG TGCATAAACACT AAT GCAGTCAAT T TGAACAAAAGAAG 
TGACATACACAATATAAATCATAT GT CTTCACACGTTGCCT ATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACA^AGATGGTTGTTCGGGGTTTGGG 
AT TGACACT GGAGGCAGATAGT TGCAAAGT TAGT C TAAGGT T T CCC TAGC T G TAT T TAGC CTCTG 
AC TAT AT TAG TATACAAAGAGGT CAT GTGGT TGAGACCAGGTGAATAG TCAC TAT CAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAAT GGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQISrWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 2 66-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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